Skip to yearly menu bar Skip to main content


CVPR 2025 Accepted Papers

This page is cached for 1 hour.  Changes to affiliation or name in your local profile may take up to 60 minutes to appear here.

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis Poster Session 5
Jiangyong Huang ⋅ Baoxiong Jia ⋅ Yan Wang ⋅ Ziyu Zhu ⋅ Xiongkun Linghu ⋅ Qing Li ⋅ Song-Chun Zhu ⋅ Siyuan Huang
ExHall D Poster #339
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World Poster Session 4
Bangyan Liao ⋅ Zhenjun Zhao ⋅ Haoang Li ⋅ Yi Zhou ⋅ Yingping Zeng ⋅ Hao Li ⋅ Peidong Liu
ExHall D Poster #102
Beyond Human Perception: Understanding Multi-Object World from Monocular View Poster Session 1
Keyu Guo ⋅ Yongle Huang ⋅ Shijie Sun ⋅ Xiangyu Song ⋅ Mingtao Feng ⋅ Zedong Liu ⋅ Huansheng Song ⋅ Tiantian Wang ⋅ Jianxin Li ⋅ Naveed Akhtar ⋅ Ajmal Mian
ExHall D Poster #341
Deep Fair Multi-View Clustering with Attention KAN Poster Session 1
HaiMing Xu ⋅ Qianqian Wang ⋅ Boyue Wang ⋅ Quanxue Gao
ExHall D Poster #468
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level Poster Session 2
Andong Deng ⋅ Tongjia Chen ⋅ Shoubin Yu ⋅ Taojiannan Yang ⋅ Lincoln Spencer ⋅ Yapeng Tian ⋅ Ajmal Mian ⋅ Mohit Bansal ⋅ Chen Chen
ExHall D Poster #311
Distinguish Then Exploit: Source-free Open Set Domain Adaptation via Weight Barcode Estimation and Sparse Label Assignment Poster Session 1
Weiming Liu ⋅ Jun Dan ⋅ Fan Wang ⋅ Xinting Liao ⋅ Junhao Dong ⋅ Hua Yu ⋅ Shunjie Dong ⋅ Lianyong Qi
ExHall D Poster #455
MatAnyone: Stable Video Matting with Consistent Memory Propagation Poster Session 2
Peiqing Yang ⋅ Shangchen Zhou ⋅ Jixin Zhao ⋅ Qingyi Tao ⋅ Chen Change Loy
ExHall D Poster #185
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction Poster Session 6
Shiyu Zhao ⋅ Zhenting Wang ⋅ Felix Juefei-Xu ⋅ Xide Xia ⋅ Miao Liu ⋅ Xiaofang Wang ⋅ Mingfu Liang ⋅ Ning Zhang ⋅ Dimitris N. Metaxas ⋅ Licheng Yu
ExHall D Poster #356
Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning Poster Session 5
Can Küçüksözen ⋅ Yucel Yemez
ExHall D Poster #415
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Poster Session 5
Lee Chae-Yeon ⋅ Oh Hyun-Bin ⋅ Han EunGi ⋅ Kim Sung-Bin ⋅ Suekyeong Nam ⋅ Tae-Hyun Oh
ExHall D Poster #2
MambaOut: Do We Really Need Mamba for Vision? Poster Session 1
Weihao Yu ⋅ Xinchao Wang
ExHall D Poster #414
GigaHands: A Massive Annotated Dataset of Bimanual Hand Activities Poster Session 4
Rao Fu ⋅ Dingxi Zhang ⋅ Alex Jiang ⋅ Wanjia Fu ⋅ Austin Funk ⋅ Daniel Ritchie ⋅ Srinath Sridhar
ExHall D Poster #159
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? Poster Session 2
Yunlong Tang ⋅ JunJia Guo ⋅ Hang Hua ⋅ Susan Liang ⋅ Mingqian Feng ⋅ Xinyang Li ⋅ Rui Mao ⋅ Chao Huang ⋅ Jing Bi ⋅ Zeliang Zhang ⋅ Pooyan Fazli ⋅ Chenliang Xu
ExHall D Poster #297
DIFIX3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Poster Session 6
Jay Zhangjie Wu ⋅ Yuxuan Zhang ⋅ Haithem Turki ⋅ Xuanchi Ren ⋅ Jun Gao ⋅ Mike Zheng Shou ⋅ Sanja Fidler ⋅ Žan Gojčič ⋅ Huan Ling
ExHall D Poster #57
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences Poster Session 2
Zhikai Li ⋅ Xuewen Liu ⋅ Dongrong Joe Fu ⋅ Jianquan Li ⋅ Qingyi Gu ⋅ Kurt Keutzer ⋅ Zhen Dong
ExHall D Poster #359
A3: Few-shot Prompt Learning of Unlearnable Examples with Cross-Modal Adversarial Feature Alignment Poster Session 2
Xuan Wang ⋅ Xitong Gao ⋅ Dongping Liao ⋅ Tianrui Qin ⋅ Yu-liang Lu ⋅ Cheng-Zhong Xu
ExHall D Poster #394
Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception Poster Session 3
Baixuan Lv ⋅ Yaohua Zha ⋅ Tao Dai ⋅ Xue Yuerong ⋅ Ke Chen ⋅ Shu-Tao Xia
ExHall D Poster #168
Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models Poster Session 4
Jiuming Liu ⋅ Jinru Han ⋅ Lihao Liu ⋅ Angelica I Aviles-Rivero ⋅ Chaokang Jiang ⋅ Zhe Liu ⋅ Hesheng Wang
ExHall D Poster #174
Birth and Death of a Rose Poster Session 6
Chen Geng ⋅ Yunzhi Zhang ⋅ Shangzhe Wu ⋅ Jiajun Wu
ExHall D Poster #11
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment Poster Session 6
Jiayi Guo ⋅ Zhao Junhao ⋅ Chaoqun Du ⋅ Yulin Wang ⋅ Chunjiang Ge ⋅ Zanlin Ni ⋅ Shiji Song ⋅ Humphrey Shi ⋅ Gao Huang
ExHall D Poster #417
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification Poster Session 4
Jianwei Zhao ⋅ XIN LI ⋅ Fan Yang ⋅ Qiang Zhai ⋅ Ao Luo ⋅ Yang Zhao ⋅ Hong Cheng ⋅ Huazhu Fu
ExHall D Poster #474
Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing Poster Session 5
Zhuowei Li ⋅ Tianchen Zhao ⋅ Xiang Xu ⋅ Zheng Zhang ⋅ Zhihua Li ⋅ Xuanbai Chen ⋅ Qin ZHANG ⋅ Alessandro Bergamo ⋅ Anil Kumar Jain ⋅ Yifan Xing
ExHall D Poster #318
HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery Poster Session 5
Jingtao Li ⋅ Yingyi Liu ⋅ XINYU WANG ⋅ Yunning Peng ⋅ Chen Sun ⋅ Shaoyu Wang ⋅ Zhendong Sun ⋅ Tian Ke ⋅ Xiao Jiang ⋅ Tangwei Lu ⋅ Anran Zhao ⋅ Yanfei Zhong
ExHall D Poster #189
FastVLM: Efficient Vision Encoding for Vision Language Models Poster Session 4
Pavan Kumar Anasosalu Vasu ⋅ Fartash Faghri ⋅ Chun-Liang Li ⋅ Cem Koc ⋅ Nate True ⋅ Gokula Krishnan Santhanam ⋅ Albert Antony ⋅ James Gabriel ⋅ Peter Grasch ⋅ Oncel Tuzel ⋅ Hadi Pouransari
ExHall D Poster #378
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images Poster Session 1
Junxian Wu ⋅ Minheng Chen ⋅ Xinyi Ke ⋅ Tianwang Xun ⋅ Xiaoming Jiang ⋅ Hongyu Zhou ⋅ Lizhi Shao ⋅ Youyong Kong
ExHall D Poster #476
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way Poster Session 3
Jiazi Bu ⋅ Pengyang Ling ⋅ Pan Zhang ⋅ Tong Wu ⋅ Xiaoyi Dong ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Dahua Lin ⋅ Jiaqi Wang
ExHall D Poster #224
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems Poster Session 2
Song Xia ⋅ Yi Yu ⋅ Wenhan Yang ⋅ MEIWEN DING ⋅ Zhuo Chen ⋅ Ling-Yu Duan ⋅ Alex C. Kot ⋅ Xudong Jiang
ExHall D Poster #323
Flexible Frame Selection for Efficient Video Reasoning Poster Session 6
Shyamal Buch ⋅ Arsha Nagrani ⋅ Anurag Arnab ⋅ Cordelia Schmid
ExHall D Poster #280
Pippo: High-Resolution Multi-View Humans from a Single Image Poster Session 4
Yash Kant ⋅ Ethan Weber ⋅ Jin Kyu Kim ⋅ Rawal Khirodkar ⋅ Zhaoen Su ⋅ Julieta Martinez ⋅ Igor Gilitschenski ⋅ Shunsuke Saito ⋅ Timur Bagautdinov
ExHall D Poster #55
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Poster Session 1
Hongyan Zhi ⋅ Peihao Chen ⋅ Junyan Li ⋅ Shuailei Ma ⋅ Xinyu Sun ⋅ Tianhang Xiang ⋅ Yinjie Lei ⋅ Mingkui Tan ⋅ Chuang Gan
ExHall D Poster #342
Scaling Down Text Encoders of Text-to-Image Diffusion Models Poster Session 4
Lifu Wang ⋅ Daqing Liu ⋅ Xinchen Liu ⋅ Xiaodong He
ExHall D Poster #253
FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification Poster Session 3
Zhengrui Guo ⋅ Conghao Xiong ⋅ Jiabo MA ⋅ Qichen Sun ⋅ Lishuang Feng ⋅ Jinzhuo Wang ⋅ Hao Chen
ExHall D Poster #473
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Poster Session 3
Shenghao Fu ⋅ Qize Yang ⋅ Qijie Mo ⋅ Junkai Yan ⋅ Xihan Wei ⋅ Jingke Meng ⋅ Xiaohua Xie ⋅ Wei-Shi Zheng
ExHall D Poster #415
Shading Meets Motion: Self-supervised Indoor 3D Reconstruction Via Simultaneous Shape-from-Shading and Structure-from-Motion Poster Session 4
Guoyu Lu
ExHall D Poster #64
MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures Poster Session 3
Lucas Morin ⋅ Valery Weber ⋅ Ahmed Nassar ⋅ Gerhard Ingmar Meijer ⋅ Luc Van Gool ⋅ Yawei Li ⋅ Peter W. J. Staar
ExHall D Poster #368
Removing Reflections from RAW Photos Poster Session 1
Eric Kee ⋅ Adam Pikielny ⋅ Kevin Blackburn-Matzen ⋅ Marc Levoy
ExHall D Poster #169
RNG: Relightable Neural Gaussians Poster Session 6
Jiahui Fan ⋅ Fujun Luan ⋅ Jian Yang ⋅ Milos Hasan ⋅ Beibei Wang
ExHall D Poster #35
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model Poster Session 3
Jun Zhou ⋅ Jiahao Li ⋅ Zunnan Xu ⋅ Hanhui Li ⋅ Yiji Cheng ⋅ Fa-Ting Hong ⋅ Qin Lin ⋅ qinglin lu ⋅ Xiaodan Liang
ExHall D Poster #233
Descriptor-In-Pixel : Point-Feature Tracking For Pixel Processor Arrays Poster Session 2
Laurie Bose ⋅ Piotr Dudek ⋅ Jianing Chen
ExHall D Poster #88
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Poster Session 5
Ronghao Dang ⋅ Yuqian Yuan ⋅ Wenqi Zhang ⋅ Yifei Xin ⋅ Boqiang Zhang ⋅ Long Li ⋅ Liuyi Wang ⋅ qinyang zeng ⋅ Xin Li ⋅ Lidong Bing
ExHall D Poster #341
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction Poster Session 5
Lingteng Qiu ⋅ Shenhao Zhu ⋅ Qi Zuo ⋅ Xiaodong Gu ⋅ Yuan Dong ⋅ Junfei Zhang ⋅ Chao Xu ⋅ Zhe Li ⋅ Weihao Yuan ⋅ Liefeng Bo ⋅ Guanying Chen ⋅ Zilong Dong
ExHall D Poster #10
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning Poster Session 3
Tian Liu ⋅ Huixin Zhang ⋅ Shubham Parashar ⋅ Shu Kong
ExHall D Poster #425
Matrix-Free Shared Intrinsics Bundle Adjustment Poster Session 6
Daniel Safari
ExHall D Poster #83
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability Poster Session 3
Lei Wang ⋅ Senmao Li ⋅ Fei Yang ⋅ Jianye Wang ⋅ Ziheng Zhang ⋅ Yuhan Liu ⋅ Yaxing Wang ⋅ Jian Yang
ExHall D Poster #213
Doppelgängers and Adversarial Vulnerability Poster Session 2
George Kamberov
ExHall D Poster #464
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference Poster Session 6
Jinhong Deng ⋅ Yuhang Yang ⋅ Wen Li ⋅ Lixin Duan
ExHall D Poster #365
Federated Learning with Domain Shift Eraser Poster Session 1
Zheng Wang ⋅ Zihui Wang ⋅ Zheng Wang ⋅ Xiaoliang Fan ⋅ Cheng Wang
ExHall D Poster #460
Customized Condition Controllable Generation for Video Soundtrack Poster Session 5
Fan Qi ⋅ KunSheng Ma ⋅ Changsheng Xu
ExHall D Poster #277
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Poster Session 4
Ali Athar ⋅ Xueqing Deng ⋅ Liang-Chieh Chen
ExHall D Poster #308
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons Poster Session 1
Yuanyou Xu ⋅ Zongxin Yang ⋅ Yi Yang
ExHall D Poster #14
AnimateAnything: Consistent and Controllable Animation for Video Generation Poster Session 6
guojun lei ⋅ Chi Wang ⋅ Rong Zhang ⋅ Yikai Wang ⋅ Hong Li ⋅ Weiwei Xu
ExHall D Poster #169
SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Poster Session 6
Jongmin Park ⋅ Minh-Quan Viet Bui ⋅ Juan Luis Gonzalez Bello ⋅ Jaeho Moon ⋅ Jihyong Oh ⋅ Munchurl Kim
ExHall D Poster #69
Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation Poster Session 4
Tianhao Qi ⋅ Jianlong Yuan ⋅ Wanquan Feng ⋅ Shancheng Fang ⋅ Jiawei Liu ⋅ SiYu Zhou ⋅ Qian HE ⋅ Hongtao Xie ⋅ Yongdong Zhang
ExHall D Poster #291
AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification Poster Session 1
Huy Nguyen ⋅ Kien Nguyen Thanh ⋅ Akila Pemasiri ⋅ Feng Liu ⋅ Sridha Sridharan ⋅ Clinton Fookes
ExHall D Poster #100
Anomize: Better Open Vocabulary Video Anomaly Detection Poster Session 6
Fei Li ⋅ Wenxuan Liu ⋅ Jingjing Chen ⋅ Ruixu Zhang ⋅ Yuran Wang ⋅ Xian Zhong ⋅ Zheng Wang
ExHall D Poster #292
CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology Poster Session 2
Yuxuan Sun ⋅ Yixuan Si ⋅ Chenglu Zhu ⋅ Xuan Gong ⋅ Kai Zhang ⋅ Pingyi Chen ⋅ Ye Zhang ⋅ Zhongyi Shui ⋅ Tao Lin ⋅ Lin Yang
ExHall D Poster #475
Uncertain Multimodal Intention and Emotion Understanding in the Wild Poster Session 5
Qu Yang ⋅ QingHongYa Shi ⋅ Tongxin Wang ⋅ Mang Ye
ExHall D Poster #351
LIM: Large Interpolator Model for Dynamic Reconstruction Poster Session 2
Remy Sabathier ⋅ Niloy J. Mitra ⋅ David Novotny
ExHall D Poster #68
DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows Poster Session 5
Mashrur M. Morshed ⋅ Vishnu Naresh Boddeti
ExHall D Poster #215
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model Poster Session 2
Mingju Gao ⋅ Yike Pan ⋅ Huan-ang Gao ⋅ Zongzheng Zhang ⋅ Wenyi Li ⋅ Hao Dong ⋅ Hao Tang ⋅ Li Yi ⋅ Hao Zhao
ExHall D Poster #156
Reasoning in Visual Navigation of End-to-end Trained Agents: A Dynamical Systems Approach Poster Session 3
Steeven JANNY ⋅ Hervé Poirier ⋅ Leonid Antsfeld ⋅ Guillaume Bono ⋅ Gianluca Monaci ⋅ Boris Chidlovskii ⋅ Francesco Giuliari ⋅ Alessio Del Bue ⋅ Christian Wolf
ExHall D Poster #141
iSegMan: Interactive Segment-and-Manipulate 3D Gaussians Poster Session 1
Yian Zhao ⋅ Wanshi Xu ⋅ Ruochong Zheng ⋅ Pengchong Qiao ⋅ Chang Liu ⋅ Jie Chen
ExHall D Poster #46
FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation Poster Session 1
Fengyi Fu ⋅ Lei Zhang ⋅ Mengqi Huang ⋅ Zhendong Mao
ExHall D Poster #239
Symbolic Representation for Any-to-Any Generative Tasks Poster Session 6
Jiaqi Chen ⋅ Xiaoye Zhu ⋅ Yue Wang ⋅ Tianyang Liu ⋅ Xinhui Chen ⋅ Ying Chen ⋅ Chak Tou Leong ⋅ Yifei Ke ⋅ Joseph Liu ⋅ Yiwen Yuan ⋅ Julian McAuley ⋅ Li-jia Li
ExHall D Poster #157
Shape Abstraction via Marching Differentiable Support Functions Poster Session 4
Sunkyung Park ⋅ Jeongmin Lee ⋅ Dongjun Lee
ExHall D Poster #104
ILIAS: Instance-Level Image retrieval At Scale Poster Session 3
Giorgos Kordopatis-Zilos ⋅ Vladan Stojnić ⋅ Anna Manko ⋅ Pavel Suma ⋅ Nikolaos-Antonios Ypsilantis ⋅ Nikos Efthymiadis ⋅ Zakaria Laskar ⋅ Jiri Matas ⋅ Ondrej Chum ⋅ Giorgos Tolias
ExHall D Poster #395
Universal Scene Graph Generation Poster Session 3
Shengqiong Wu ⋅ Hao Fei ⋅ Tat-seng Chua
ExHall D Poster #336
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Poster Session 1
Miran Heo ⋅ Min-Hung Chen ⋅ De-An Huang ⋅ Sifei Liu ⋅ Subhashree Radhakrishnan ⋅ Seon Joo Kim ⋅ Yu-Chiang Frank Wang ⋅ Ryo Hachiuma
ExHall D Poster #357
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Poster Session 2
Xuanchi Ren ⋅ Tianchang Shen ⋅ Jiahui Huang ⋅ Huan Ling ⋅ Yifan Lu ⋅ Merlin Nimier-David ⋅ Thomas Müller ⋅ Alexander Keller ⋅ Sanja Fidler ⋅ Jun Gao
ExHall D Poster #65
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation Poster Session 6
Seokil Ham ⋅ Hee-Seon Kim ⋅ Sangmin Woo ⋅ Changick Kim
ExHall D Poster #378
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering Poster Session 4
Tianyu Huai ⋅ Jie Zhou ⋅ Xingjiao Wu ⋅ Qin Chen ⋅ Qingchun Bai ⋅ Zezhou ⋅ Liang He
ExHall D Poster #362
DiskVPS: Vanishing Point Detector via Hough Transform in a Disk Region Poster Session 6
Jianping Wu
ExHall D Poster #86
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation Poster Session 6
Mingfei Han ⋅ Liang Ma ⋅ Kamila Zhumakhanova ⋅ Ekaterina Radionova ⋅ Jingyi Zhang ⋅ Xiaojun Chang ⋅ Xiaodan Liang ⋅ Ivan Laptev
ExHall D Poster #136
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing Poster Session 1
Zhedong Zhang ⋅ Liang Li ⋅ Chenggang Yan ⋅ Chunshan Liu ⋅ Anton van den Hengel ⋅ Yuankai Qi
ExHall D Poster #1
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition Poster Session 4
Caoshuo Li ⋅ Tanzhe Li ⋅ Xiaobin Hu ⋅ Donghao Luo ⋅ Taisong Jin
ExHall D Poster #415
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation Poster Session 3
Mingcheng Li ⋅ Xiaolu Hou ⋅ Ziyang Liu ⋅ Dingkang Yang ⋅ Ziyun Qian ⋅ Jiawei Chen ⋅ Jinjie Wei ⋅ Yue Jiang ⋅ Qingyao Xu ⋅ Lihua Zhang
ExHall D Poster #249
Data Distributional Properties As Inductive Bias for Systematic Generalization Poster Session 5
Felipe del Rio ⋅ Alain Raymond ⋅ Daniel Florea ⋅ Rodrigo Toro Icarte ⋅ Julio Hurtado ⋅ Cristian Buc Calderon ⋅ Alvaro Soto
ExHall D Poster #435
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training Poster Session 6
Haicheng Wang ⋅ Chen Ju ⋅ Weixiong Lin ⋅ Mengting Chen ⋅ Shuai Xiao ⋅ Yixuan Huang ⋅ Chang Liu ⋅ mingshuai Yao ⋅ Jinsong Lan ⋅ Ying Chen ⋅ Qingwen Liu ⋅ Yanfeng Wang
ExHall D Poster #349
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges Poster Session 2
Joshua Fixelle
ExHall D Poster #417
Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment Poster Session 2
Huakai Lai ⋅ Guoxin Xiong ⋅ Huayu Mai ⋅ Xiang Liu ⋅ Tianzhu Zhang
ExHall D Poster #368
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding Poster Session 3
Zining Wang ⋅ Tongkun Guan ⋅ Pei Fu ⋅ Chen Duan ⋅ Qianyi Jiang ⋅ Zhentao Guo ⋅ Shan Guo ⋅ Junfeng Luo ⋅ Wei Shen ⋅ Xiaokang Yang
ExHall D Poster #364
GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction Poster Session 5
Li Zhang ⋅ mingliang xu ⋅ Jianan Wang ⋅ Qiaojun Yu ⋅ Lixin Yang ⋅ Yonglu Li ⋅ Cewu Lu ⋅ RujingWang ⋅ Liu Liu
ExHall D Poster #150
SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection Poster Session 1
Phi Vu Tran
ExHall D Poster #431
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals Poster Session 3
Changhao Peng
ExHall D Poster #109
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Poster Session 3
Xi Chen ⋅ Zhifei Zhang ⋅ He Zhang ⋅ Yuqian Zhou ⋅ Soo Ye Kim ⋅ Qing Liu ⋅ Yijun Li ⋅ Jianming Zhang ⋅ Nanxuan Zhao ⋅ Yilin Wang ⋅ Hui Ding ⋅ Zhe Lin ⋅ Hengshuang Zhao
ExHall D Poster #176
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics Poster Session 1
Chen Liu ⋅ Liying Yang ⋅ Peike Li ⋅ Dadong Wang ⋅ Lincheng Li ⋅ Xin Yu
ExHall D Poster #284
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Poster Session 2
Fanqi Pu ⋅ Yifan Wang ⋅ Jiru Deng ⋅ Wenming Yang
ExHall D Poster #109
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception Poster Session 3
Yuanchen Wu ⋅ Lu Zhang ⋅ Hang Yao ⋅ Junlong Du ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Yunsheng Wu ⋅ Xiaoqiang Li
ExHall D Poster #383
Apollo: An Exploration of Video Understanding in Large Multimodal Models Poster Session 4
Orr Zohar ⋅ Xiaohan Wang ⋅ Yann Dubois ⋅ Nikhil Mehta ⋅ Tong Xiao ⋅ Philippe Hansen-Estruch ⋅ Licheng Yu ⋅ Xiaofang Wang ⋅ Felix Juefei-Xu ⋅ Ning Zhang ⋅ Serena Yeung ⋅ Xide Xia
ExHall D Poster #296
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis Poster Session 4
Jiaqi Liu ⋅ Jichao Zhang ⋅ Paolo Rota ⋅ Nicu Sebe
ExHall D Poster #15
EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling Poster Session 1
Songpengcheng Xia ⋅ Yu Zhang ⋅ Zhuo Su ⋅ Xiaozheng Zheng ⋅ Zheng Lv ⋅ Guidong Wang ⋅ Yongjie Zhang ⋅ Qi Wu ⋅ Lei Chu ⋅ Ling Pei
ExHall D Poster #155
Visual Lexicon: Rich Image Features in Language Space Poster Session 4
XuDong Wang ⋅ Xingyi Zhou ⋅ Alireza Fathi ⋅ Trevor Darrell ⋅ Cordelia Schmid
ExHall D Poster #375
Global-Local Tree Search in VLMs for 3D Indoor Scene Generation Poster Session 2
Wei Deng ⋅ Mengshi Qi ⋅ Huadong Ma
ExHall D Poster #345
Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge Poster Session 5
Yaqi Zhao ⋅ Yuanyang Yin ⋅ Lin Li ⋅ Mingan Lin ⋅ Victor Shea-Jay Huang ⋅ Siwei Chen ⋅ Weipeng Chen ⋅ Baoqun Yin ⋅ Zenan Zhou ⋅ Wentao Zhang
ExHall D Poster #374
Volumetric Surfaces: Representing Fuzzy Geometries with Layered Meshes Poster Session 5
Stefano Esposito ⋅ Anpei Chen ⋅ Christian Reiser ⋅ Samuel Rota Bulò ⋅ Lorenzo Porzi ⋅ Katja Schwarz ⋅ Christian Richardt ⋅ Michael Zollhoefer ⋅ Peter Kontschieder ⋅ Andreas Geiger
ExHall D Poster #31
Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation Poster Session 4
Yanda Chen ⋅ Gongwei Chen ⋅ Miao Zhang ⋅ Weili Guan ⋅ Liqiang Nie
ExHall D Poster #441
HumanMM: Global Human Motion Recovery from Multi-shot Videos Poster Session 1
Yuhong Zhang ⋅ Guanlin Wu ⋅ Ling-Hao Chen ⋅ Zhuokai Zhao ⋅ Jing Lin ⋅ Xiaoke Jiang ⋅ Jiamin WU ⋅ Zhuoheng Li ⋅ Hao Frank Yang ⋅ Haoqian Wang ⋅ Lei Zhang
ExHall D Poster #167
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Poster Session 4
Hanyang Wang ⋅ Fangfu Liu ⋅ Jiawei Chi ⋅ Yueqi Duan
ExHall D Poster #61
RORem: Training a Robust Object Remover with Human-in-the-Loop Poster Session 3
Ruibin Li ⋅ Tao Yang ⋅ Song Guo ⋅ Lei Zhang
ExHall D Poster #323
FineVQ: Fine-Grained User Generated Content Video Quality Assessment Poster Session 1
Huiyu Duan ⋅ Qiang Hu ⋅ Wang Jiarui ⋅ Liu Yang ⋅ Zitong Xu ⋅ Lu Liu ⋅ Xiongkuo Min ⋅ Chunlei Cai ⋅ Tianxiao Ye ⋅ Xiaoyun Zhang ⋅ Guangtao Zhai
ExHall D Poster #291
Uncertainty Weighted Gradients for Model Calibration Poster Session 3
Jinxu Lin ⋅ Linwei Tao ⋅ Minjing Dong ⋅ Chang Xu
ExHall D Poster #464
ATA: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting Poster Session 4
Yizhe Tang ⋅ Zhimin Sun ⋅ Yuzhen Du ⋅ Ran Yi ⋅ Guangben Lu ⋅ Teng Hu ⋅ LUYING LI ⋅ Lizhuang Ma ⋅ FangYuan Zou
ExHall D Poster #242
SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Poster Session 1
Shuhan Tan ⋅ John Wheatley Lambert ⋅ Hong Jeon ⋅ Sakshum Kulshrestha ⋅ Yijing Bai ⋅ Jing Luo ⋅ Dragomir Anguelov ⋅ Mingxing Tan ⋅ Chiyu “Max” Jiang
ExHall D Poster #131
MINIMA: Modality Invariant Image Matching Poster Session 5
Jiangwei Ren ⋅ Xingyu Jiang ⋅ Zizhuo Li ⋅ Dingkang Liang ⋅ Xin Zhou ⋅ Xiang Bai
ExHall D Poster #190
Gaussian Eigen Models for Human Heads Poster Session 4
Wojciech Zielonka ⋅ Timo Bolkart ⋅ Thabo Beeler ⋅ Justus Thies
ExHall D Poster #7
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model Poster Session 4
Ziyu Yao ⋅ Xuxin Cheng ⋅ Zhiqi Huang ⋅ Lei Li
ExHall D Poster #319
SapiensID: Foundation for Human Recognition Poster Session 3
Minchul Kim ⋅ Dingqiang Ye ⋅ Yiyang Su ⋅ Feng Liu ⋅ Xiaoming Liu
ExHall D Poster #314
Causal Composition Diffusion Model for Closed-loop Traffic Generation Poster Session 6
Haohong Lin ⋅ Xin Huang ⋅ Tung Phan-Minh ⋅ David S Hayden ⋅ Huan Zhang ⋅ DING ZHAO ⋅ Siddhartha Srinivasa ⋅ Eric M. Wolff ⋅ Hongge Chen
ExHall D Poster #132
BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction Poster Session 4
Yuguang Li ⋅ Ivaylo Boyadzhiev ⋅ Zixuan Liu ⋅ Linda Shapiro ⋅ Alex Colburn
ExHall D Poster #92
Scaling Mesh Generation via Compressive Tokenization Poster Session 3
Haohan Weng ⋅ Zibo Zhao ⋅ Biwen Lei ⋅ Xianghui Yang ⋅ Jian Liu ⋅ Zeqiang Lai ⋅ Zhuo Chen ⋅ Liu Yuhong ⋅ Jie Jiang ⋅ Chunchao Guo ⋅ Tong Zhang ⋅ Shenghua Gao ⋅ C.L.Philip Chen
ExHall D Poster #42
Odd-One-Out: Anomaly Detection by Comparing with Neighbors Poster Session 4
Ankan Kumar Bhunia ⋅ Changjian Li ⋅ Hakan Bilen
ExHall D Poster #437
Pay Attention to the Foreground in Object-Centric Learning Poster Session 6
Pinzhuo Tian ⋅ Shengjie Yang ⋅ Hang Yu ⋅ Alex C. Kot
ExHall D Poster #396
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion Poster Session 3
Jongseong Bae ⋅ Junwoo Ha ⋅ Ha Young Kim
ExHall D Poster #125
MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis Poster Session 6
Tianyu Wang ⋅ Jianming Zhang ⋅ Haitian Zheng ⋅ Zhihong Ding ⋅ Scott Cohen ⋅ Zhe Lin ⋅ Wei Xiong ⋅ Chi-Wing Fu ⋅ Luis Figueroa ⋅ Soo Ye Kim
ExHall D Poster #199
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety Poster Session 1
Andrei Dumitriu ⋅ Florin Tatui ⋅ Florin Miron ⋅ Aakash Ralhan ⋅ Radu Tudor Ionescu ⋅ Radu Timofte
ExHall D Poster #311
Low-Rank Adaptation in Multilinear Operator Networks for Security-Preserving Incremental Learning Poster Session 5
Huu Binh Ta ⋅ Duc Nguyen ⋅ Quyen Tran ⋅ Toan Tran ⋅ Tung Pham
ExHall D Poster #317
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World Poster Session 3
Ankit Dhiman ⋅ Manan Shah ⋅ R. Venkatesh Babu
ExHall D Poster #56
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models Poster Session 1
Qingqing Zhao ⋅ Yao Lu ⋅ Moo Jin Kim ⋅ Zipeng Fu ⋅ Zhuoyang Zhang ⋅ Yecheng Wu ⋅ Max Li ⋅ Qianli Ma ⋅ Song Han ⋅ Chelsea Finn ⋅ Ankur Handa ⋅ Tsung-Yi Lin ⋅ Gordon Wetzstein ⋅ Ming-Yu Liu ⋅ Donglai Xiang
ExHall D Poster #143
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation Poster Session 1
Yichen Xie ⋅ Runsheng Xu ⋅ Tong He ⋅ Jyh-Jing Hwang ⋅ Katie Z Luo ⋅ Jingwei Ji ⋅ Hubert Lin ⋅ Letian Chen ⋅ Yiren Lu ⋅ Zhaoqi Leng ⋅ Dragomir Anguelov ⋅ Mingxing Tan
ExHall D Poster #136
T-FAKE: Synthesizing Thermal Images for Facial Landmarking Poster Session 6
Philipp Flotho ⋅ Moritz Piening ⋅ Anna Kukleva ⋅ Gabriele Steidl
ExHall D Poster #16
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models Poster Session 5
Xinghui Li ⋅ Qichao Sun ⋅ Pengze Zhang ⋅ Fulong Ye ⋅ Zhichao Liao ⋅ Wanquan Feng ⋅ Songtao Zhao ⋅ Qian HE
ExHall D Poster #258
Self-Supervised Spatial Correspondence Across Modalities Poster Session 2
Ayush Shrivastava ⋅ Andrew Owens
ExHall D Poster #96
Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch Poster Session 2
Yijie Liu ⋅ Xinyi Shang ⋅ Yiqun Zhang ⋅ Yang Lu ⋅ Chen Gong ⋅ Jing-Hao Xue ⋅ Hanzi Wang
ExHall D Poster #457
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views Poster Session 5
Shangzhan Zhang ⋅ Jianyuan Wang ⋅ Yinghao Xu ⋅ Nan Xue ⋅ Christian Rupprecht ⋅ Xiaowei Zhou ⋅ Yujun Shen ⋅ Gordon Wetzstein
ExHall D Poster #84
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model Poster Session 4
Yingying Fan ⋅ Quanwei Yang ⋅ Kaisiyuan Wang ⋅ Hang Zhou ⋅ Yingying Li ⋅ Haocheng Feng ⋅ Errui Ding ⋅ Yu Wu ⋅ Jingdong Wang
ExHall D Poster #167
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models Poster Session 2
Yoojin Jung ⋅ Byung Cheol Song
ExHall D Poster #412
VisionArena: 230k Real World User-VLM Conversations with Preference Labels Poster Session 1
Christopher Chou ⋅ Lisa Dunlap ⋅ Wei-Lin Chiang ⋅ Koki Mashita ⋅ Krishna Mandal ⋅ Trevor Darrell ⋅ Ion Stoica ⋅ Joseph Gonzalez
ExHall D Poster #353
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers Poster Session 2
Nikaan Nikzad ⋅ YI LIAO ⋅ Yongsheng Gao ⋅ Jun Zhou
ExHall D Poster #415
A Theory of Learning Unified Model via Knowledge Integration from Label Space Varying Domains Poster Session 2
Dexuan Zhang ⋅ Thomas Westfechtel ⋅ Tatsuya Harada
ExHall D Poster #454
Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning Poster Session 3
Isma Hadji ⋅ Mehdi Noroozi ⋅ Victor Escorcia ⋅ Anestis Zaganidis ⋅ Brais Martinez ⋅ Georgios Tzimiropoulos
ExHall D Poster #204
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving Poster Session 1
R.D. Lin ⋅ Pengcheng Weng ⋅ Yinqiao Wang ⋅ Han Ding ⋅ Jinsong Han ⋅ Fei Wang
ExHall D Poster #118
Spiking Transformer with Spatial-Temporal Attention Poster Session 3
Donghyun Lee ⋅ Yuhang Li ⋅ Youngeun Kim ⋅ Shiting Xiao ⋅ Priyadarshini Panda
ExHall D Poster #315
Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated Learning Poster Session 5
Yanbiao Ma ⋅ Wei Dai ⋅ Wenke Huang ⋅ Jiayi Chen
ExHall D Poster #443
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture Poster Session 1
Qianlong Xiang ⋅ Miao Zhang ⋅ Yuzhang Shang ⋅ Jianlong Wu ⋅ Yan Yan ⋅ Liqiang Nie
ExHall D Poster #267
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Poster Session 5
Jan Held ⋅ Renaud Vandeghen ⋅ Abdullah J Hamdi ⋅ Anthony Cioppa ⋅ Adrien Deliege ⋅ Silvio Giancola ⋅ Andrea Vedaldi ⋅ Bernard Ghanem ⋅ Marc Van Droogenbroeck
ExHall D Poster #30
Focal Split: Untethered Snapshot Depth from Differential Defocus Poster Session 6
Junjie Luo ⋅ John Mamish ⋅ Alan Fu ⋅ Thomas Concannon ⋅ Josiah Hester ⋅ Emma Alexander ⋅ Qi Guo
ExHall D Poster #78
Reversible Decoupling Network for Single Image Reflection Removal Poster Session 6
Hao Zhao ⋅ Mingjia Li ⋅ Qiming Hu ⋅ Xiaojie Guo
ExHall D Poster #23
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model Poster Session 4
Zongjian Li ⋅ Bin Lin ⋅ Yang Ye ⋅ Liuhan Chen ⋅ Xinhua Cheng ⋅ Shenghai Yuan ⋅ Li Yuan
ExHall D Poster #188
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling Poster Session 5
Yifang Men ⋅ Yuan Yao ⋅ Miaomiao Cui ⋅ Liefeng Bo
ExHall D Poster #13
MLLM-as-a-Judge for Image Safety without Human Labeling Poster Session 3
Zhenting Wang ⋅ Shuming Hu ⋅ Shiyu Zhao ⋅ Xiaowen Lin ⋅ Felix Juefei-Xu ⋅ Zhuowei Li ⋅ Ligong Han ⋅ Harihar Subramanyam ⋅ Li Chen ⋅ Jianfa Chen ⋅ nan jiang ⋅ Lingjuan Lyu ⋅ Shiqing Ma ⋅ Dimitris N. Metaxas ⋅ Ankit Jain
ExHall D Poster #384
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior Poster Session 3
Chanhui Lee ⋅ Yeonghwan Song ⋅ Jeany Son
ExHall D Poster #311
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes Poster Session 2
Chensheng Peng ⋅ Chengwei Zhang ⋅ Yixiao Wang ⋅ Chenfeng Xu ⋅ Yichen Xie ⋅ Wenzhao Zheng ⋅ Kurt Keutzer ⋅ Masayoshi Tomizuka ⋅ Wei Zhan
ExHall D Poster #135
Satellite to GroundScape - Large-scale Consistent Ground View Generation from Satellite Views Poster Session 2
Ningli Xu ⋅ Rongjun Qin
ExHall D Poster #60
Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation Poster Session 4
Byung Hyun Lee ⋅ Sungjin Lim ⋅ Se Young Chun
ExHall D Poster #269
Continuous Space-Time Video Resampling with Invertible Motion Steganography Poster Session 1
Yuantong zhang ⋅ Zhenzhong Chen
ExHall D Poster #183
Point Clouds Meets Physics: Dynamic Acoustic Field Fitting Network for Point Cloud Understanding Poster Session 5
Changshuo Wang ⋅ Shuting He ⋅ Xiang Fang ⋅ Jiawei Han ⋅ Zhonghang Liu ⋅ Xin Ning ⋅ Weijun Li ⋅ Prayag Tiwari
ExHall D Poster #108
Conical Visual Concentration for Efficient Large Vision-Language Models Poster Session 3
Long Xing ⋅ Qidong Huang ⋅ Xiaoyi Dong ⋅ Jiajie Lu ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Conghui He ⋅ Jiaqi Wang ⋅ Feng Wu ⋅ Dahua Lin
ExHall D Poster #378
The Scene Language: Representing Scenes with Programs, Words, and Embeddings Poster Session 5
Yunzhi Zhang ⋅ Zizhang Li ⋅ Matt Zhou ⋅ Shangzhe Wu ⋅ Jiajun Wu
ExHall D Poster #344
Hyperbolic Uncertainty-Aware Few-Shot Incremental Point Cloud Segmentation Poster Session 3
Tanuj Sur ⋅ Samrat Mukherjee ⋅ Kaizer Rahaman ⋅ Subhasis Chaudhuri ⋅ Muhammad Haris Khan ⋅ Biplab Banerjee
ExHall D Poster #113
Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM Poster Session 2
Qiyuan Dai ⋅ Sibei Yang
ExHall D Poster #397
OralXrays-9: Towards Hospital-Scale Panoramic X-ray Anomaly Detection via Personalized Multi-Object Query-Aware Mining Poster Session 3
Bingzhi Chen ⋅ Sisi Fu ⋅ Xiaocheng Fang ⋅ Jieyi Cai ⋅ Boya Zhang ⋅ Minhua Lu ⋅ Yishu Liu
ExHall D Poster #471
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer Poster Session 2
Shaofei Huang ⋅ Rui Ling ⋅ Tianrui Hui ⋅ Hongyu Li ⋅ Xu Zhou ⋅ Shifeng Zhang ⋅ Si Liu ⋅ Richang Hong ⋅ Meng Wang
ExHall D Poster #285
MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction Poster Session 1
Xiaohao Xu ⋅ Feng Xue ⋅ Shibo Zhao ⋅ Yike Pan ⋅ Sebastian Scherer ⋅ Xiaonan Huang
ExHall D Poster #64
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention Poster Session 6
Saad Wazir ⋅ Daeyoung Kim
ExHall D Poster #451
DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters Poster Session 5
Mingze Sun ⋅ Junting Dong ⋅ Junhao Chen ⋅ Yurun Chen ⋅ Xinyu Jiang ⋅ Shiwei Mao ⋅ Puhua Jiang ⋅ Jingbo Wang ⋅ Bo Dai ⋅ Ruqi Huang
ExHall D Poster #12
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Poster Session 1
Yaniv Benny ⋅ Lior Wolf
ExHall D Poster #72
Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models Poster Session 5
Yankai Jiang ⋅ Peng Zhang ⋅ Donglin Yang ⋅ Yuan Tian ⋅ Hai Lin ⋅ Xiaosong Wang
ExHall D Poster #474
Tiled Diffusion Poster Session 2
Or Madar ⋅ Ohad Fried
ExHall D Poster #232
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better Poster Session 5
Zihang Lai ⋅ Andrea Vedaldi
ExHall D Poster #167
BOE-ViT: Boosting Orientation Estimation with Equivariance in Self-Supervised 3D Subtomogram Alignment Poster Session 6
Runmin Jiang ⋅ Jackson Daggett ⋅ Shriya Pingulkar ⋅ Yizhou Zhao ⋅ Priyanshu Dhingra ⋅ Daniel Brown ⋅ Qifeng Wu ⋅ Xiangrui Zeng ⋅ Xingjian Li ⋅ Min Xu
ExHall D Poster #306
Sufficient Invariant Learning for Distribution Shift Poster Session 1
Taero Kim ⋅ Subeen Park ⋅ Sungjun Lim ⋅ Yonghan Jung ⋅ Krikamol Muandet ⋅ Kyungwoo Song
ExHall D Poster #458
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments Poster Session 5
Yue Cao ⋅ Yun Xing ⋅ Jie Zhang ⋅ Di Lin ⋅ Tianwei Zhang ⋅ Ivor Tsang ⋅ Yang Liu ⋅ Qing Guo
ExHall D Poster #383
HUNet: Homotopy Unfolding Network for Image Compressive Sensing Poster Session 3
Feiyang Shen ⋅ Hongping Gan
ExHall D Poster #205
ViUniT: Visual Unit Tests for More Robust Visual Programming Poster Session 5
Artemis Panagopoulou ⋅ Honglu Zhou ⋅ silvio savarese ⋅ Caiming Xiong ⋅ Chris Callison-Burch ⋅ Mark Yatskar ⋅ Juan Carlos Niebles
ExHall D Poster #346
The Devil is in Low-Level Features for Cross-Domain Few-Shot Segmentation Poster Session 1
Yuhan Liu ⋅ Yixiong Zou ⋅ Yuhua Li ⋅ Ruixuan Li
ExHall D Poster #426
Let Humanoids Hike! Integrative Skill Development on Complex Trails Poster Session 5
Kwan-Yee Lin ⋅ Stella X. Yu
ExHall D Poster #137
MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects Poster Session 3
Kevin Zhang ⋅ Jia-Bin Huang ⋅ Jose Echevarria ⋅ Stephen DiVerdi ⋅ Aaron Hertzmann
ExHall D Poster #25
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization Poster Session 5
Liang Liu ⋅ Shuaiyong Li ⋅ Yongqiang Zhu
ExHall D Poster #281
Gaussian Splatting for Efficient Satellite Image Photogrammetry Poster Session 2
Luca Savant Aira ⋅ Gabriele Facciolo ⋅ Thibaud Ehret
ExHall D Poster #49
LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning Poster Session 3
Xuan Liu ⋅ Xiaobin Chang
ExHall D Poster #446
MATCHA: Towards Matching Anything Poster Session 6
Fei Xue ⋅ Sven Elflein ⋅ Laura Leal-Taixe ⋅ Qunjie Zhou
ExHall D Poster #89
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution Poster Session 4
Xin Liu ⋅ Jie Liu ⋅ Jie Tang ⋅ Gangshan Wu
ExHall D Poster #200
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration Poster Session 1
Chaojun Ni ⋅ Guosheng Zhao ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Wenkang Qin ⋅ Guan Huang ⋅ Chen Liu ⋅ Yuyin Chen ⋅ Yida Wang ⋅ Xueyang Zhang ⋅ Yifei Zhan ⋅ Kun Zhan ⋅ Peng Jia ⋅ XianPeng Lang ⋅ Xingang Wang ⋅ Wenjun Mei
ExHall D Poster #130
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy Poster Session 1
You Li ⋅ Fan Ma ⋅ Yi Yang
ExHall D Poster #363
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation Poster Session 4
Bolin Lai ⋅ Felix Juefei-Xu ⋅ Miao Liu ⋅ Xiaoliang Dai ⋅ Nikhil Mehta ⋅ Chenguang Zhu ⋅ Zeyi Huang ⋅ James Rehg ⋅ Sangmin Lee ⋅ Ning Zhang ⋅ Tong Xiao
ExHall D Poster #243
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing Poster Session 5
Hanhui Wang ⋅ Yihua Zhang ⋅ Ruizheng Bai ⋅ Yue Zhao ⋅ Sijia Liu ⋅ Zhengzhong Tu
ExHall D Poster #267
A Physics-Informed Blur Learning Framework for Imaging Systems Poster Session 3
liqun.chen ⋅ Yuxuan Li ⋅ Jun Dai ⋅ Jinwei Gu ⋅ Tianfan Xue
ExHall D Poster #24
Infighting in the Dark: Multi-Label Backdoor Attack in Federated Learning Poster Session 5
Ye Li ⋅ Yanchao Zhao ⋅ chengcheng zhu ⋅ Jiale Zhang
ExHall D Poster #453
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors Poster Session 4
Zhengfei Kuang ⋅ Tianyuan Zhang ⋅ Kai Zhang ⋅ Hao Tan ⋅ Sai Bi ⋅ Yiwei Hu ⋅ Zexiang Xu ⋅ Milos Hasan ⋅ Gordon Wetzstein ⋅ Fujun Luan
ExHall D Poster #177
PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing Poster Session 4
Peng Li ⋅ Wangguandong Zheng ⋅ Yuan Liu ⋅ Tao Yu ⋅ Yangguang Li ⋅ Xingqun Qi ⋅ Xiaowei Chi ⋅ Siyu Xia ⋅ Yan-Pei Cao ⋅ Wei Xue ⋅ Wenhan Luo ⋅ Yike Guo
ExHall D Poster #14
Tartan IMU: A Light Foundation Model for Inertial Positioning in Robotics Poster Session 5
Shibo Zhao ⋅ Sifan Zhou ⋅ Raphael Blanchard ⋅ Yuheng Qiu ⋅ Wenshan Wang ⋅ Sebastian Scherer
ExHall D Poster #139
Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions Poster Session 4
He Zhu ⋅ Quyu Kong ⋅ Kechun Xu ⋅ Xunlong Xia ⋅ Bing Deng ⋅ Jieping Ye ⋅ Rong Xiong ⋅ Yue Wang
ExHall D Poster #148
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding Poster Session 1
Wenhui Liao ⋅ Jiapeng Wang ⋅ Hongliang Li ⋅ Chengyu Wang ⋅ Jun Huang ⋅ Lianwen Jin
ExHall D Poster #368
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation Poster Session 3
Guosheng Zhao ⋅ Chaojun Ni ⋅ Xiaofeng Wang ⋅ Zheng Zhu ⋅ Xueyang Zhang ⋅ Yida Wang ⋅ Guan Huang ⋅ xinze chen ⋅ Boyuan Wang ⋅ Youyi Zhang ⋅ Wenjun Mei ⋅ Xingang Wang
ExHall D Poster #132
Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor Poster Session 1
Hao Yu ⋅ Xin Yang ⋅ Le Zhang ⋅ Hanlin Gu ⋅ Tianrui Li ⋅ Lixin Fan ⋅ Qiang Yang
ExHall D Poster #450
Hiding Images in Diffusion Models by Editing Learned Score Functions Poster Session 4
Haoyu Chen ⋅ Yunqiao Yang ⋅ Nan Zhong ⋅ Kede Ma
ExHall D Poster #275
End-to-End HOI Reconstruction Transformer with Graph-based Encoding Poster Session 6
Zhenrong Wang ⋅ Qi Zheng ⋅ Sihan Ma ⋅ Maosheng Ye ⋅ Yibing Zhan ⋅ Dongjiang Li
ExHall D Poster #147
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion Poster Session 4
Yang Wu ⋅ Yun Zhu ⋅ Kaihua Zhang ⋅ Jianjun Qian ⋅ Jin Xie ⋅ Jian Yang
ExHall D Poster #115
IDOL: Instant Photorealistic 3D Human Creation from a Single Image Poster Session 6
Yiyu Zhuang ⋅ Jiaxi Lv ⋅ Hao Wen ⋅ Qing Shuai ⋅ Ailing Zeng ⋅ Hao Zhu ⋅ Shifeng Chen ⋅ Yujiu Yang ⋅ Xun Cao ⋅ Wei Liu
ExHall D Poster #10
Decouple Distortion from Perception: Region Adaptive Diffusion for Extreme-low Bitrate Perception Image Compression Poster Session 4
Jinchang Xu ⋅ Shaokang Wang ⋅ Jintao Chen ⋅ Zhe Li ⋅ Peidong Jia ⋅ Fei Zhao ⋅ Guoqing Xiang ⋅ Zhijian Hao ⋅ Shanghang Zhang ⋅ Xiaodong Xie
ExHall D Poster #214
SketchVideo: Sketch-based Video Generation and Editing Poster Session 5
Feng-Lin Liu ⋅ Hongbo Fu ⋅ Xintao Wang ⋅ Weicai Ye ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Lin Gao
ExHall D Poster #222
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations? Poster Session 3
Martin Spitznagel ⋅ Jan Vaillant ⋅ Janis Keuper
ExHall D Poster #45
Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting Poster Session 5
Maochen Yang ⋅ Zekun Li ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
ExHall D Poster #326
Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization Poster Session 6
Xiran Wang ⋅ Jian Zhang ⋅ Lei Qi ⋅ Yinghuan Shi
ExHall D Poster #424
Hybrid Concept Bottleneck Models Poster Session 4
Yang Liu ⋅ Tianwei Zhang ⋅ Shi Gu
ExHall D Poster #417
Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection Poster Session 1
Wenxi Chen ⋅ Raymond A. Yeh ⋅ Shaoshuai Mou ⋅ Yan Gu
ExHall D Poster #436
Neural Motion Simulator Pushing the Limit of World Models in Reinforcement Learning Poster Session 6
Chenjie Hao ⋅ Weyl Lu ⋅ Yifan Xu ⋅ Yubei Chen
ExHall D Poster #138
Adversarial Diffusion Compression for Real-World Image Super-Resolution Poster Session 6
Bin Chen ⋅ Gehui Li ⋅ Rongyuan Wu ⋅ Xindong Zhang ⋅ Jie Chen ⋅ Jian Zhang ⋅ Lei Zhang
ExHall D Poster #195
BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting Poster Session 4
Yiren Lu ⋅ Yunlai Zhou ⋅ Disheng Liu ⋅ tuo liang ⋅ Yu Yin
ExHall D Poster #66
Effortless Active Labeling for Long-Term Test-Time Adaptation Poster Session 5
Guowei Wang ⋅ Changxing Ding
ExHall D Poster #439
Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network Poster Session 5
Haifeng Zhang ⋅ Qinghui He ⋅ Xiuli Bi ⋅ Weisheng Li ⋅ Bo Liu ⋅ Bin Xiao
ExHall D Poster #269
V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy Poster Session 6
Jiayin Zhao ⋅ Zhenqi Fu ⋅ Tao Yu ⋅ Hui Qiao
ExHall D Poster #25
A Unified Framework for Heterogeneous Semi-supervised Learning Poster Session 3
Marzi Heidari ⋅ Abdullah Alchihabi ⋅ Hao Yan ⋅ Yuhong Guo
ExHall D Poster #452
DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels Poster Session 3
Erjian Guo ⋅ Zhen Zhao ⋅ Zicheng Wang ⋅ Tong Chen ⋅ YUNYI LIU ⋅ Luping Zhou
ExHall D Poster #352
Splatter-360: Generalizable 360 Gaussian Splatting for Wide-baseline Panoramic Images Poster Session 5
Zheng Chen ⋅ Chenming Wu ⋅ Zhelun Shen ⋅ Chen Zhao ⋅ Weicai Ye ⋅ Haocheng Feng ⋅ Errui Ding ⋅ Song-Hai Zhang
ExHall D Poster #51
ShowMak3r: Compositional TV Show Reconstruction Poster Session 1
Sangmin Kim ⋅ Seunguk Do ⋅ Jaesik Park
ExHall D Poster #65
Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion Poster Session 6
ZhiFei Chen ⋅ Tianshuo Xu ⋅ Wenhang Ge ⋅ Leyi Wu ⋅ Dongyu Yan ⋅ Jing He ⋅ Luozhou Wang ⋅ Lu Zeng ⋅ Shunsi Zhang ⋅ Ying-Cong Chen
ExHall D Poster #33
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark Poster Session 1
Li Lin ⋅ Santosh Santosh ⋅ Mingyang Wu ⋅ Xin Wang ⋅ Shu Hu
ExHall D Poster #318
Learning to Filter Outlier Edges in Global SfM Poster Session 3
Nicole Damblon ⋅ Marc Pollefeys ⋅ Daniel Barath
ExHall D Poster #87
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation Poster Session 2
Qihui Zhang ⋅ Munan Ning ⋅ Zheyuan Liu ⋅ Yanbo Wang ⋅ Jiayi Ye ⋅ Yue Huang ⋅ Shuo Yang ⋅ Xiao Chen ⋅ Yibing Song ⋅ Li Yuan
ExHall D Poster #362
S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors Poster Session 4
Xingyu Ren ⋅ Jiankang Deng ⋅ Yuhao Cheng ⋅ Wenhan Zhu ⋅ Yichao Yan ⋅ Xiaokang Yang ⋅ Stefanos Zafeiriou ⋅ Chao Ma
ExHall D Poster #18
Interactive Medical Image Analysis with Concept-based Similarity Reasoning Poster Session 6
Ta Duc Huy ⋅ Sen Kim Tran ⋅ Phan Nguyen ⋅ Nguyen Hoang Tran ⋅ Tran Bao Sam ⋅ Anton van den Hengel ⋅ Zhibin Liao ⋅ Johan Verjans ⋅ Minh-Son To ⋅ Vu Minh Hieu Phan
ExHall D Poster #445
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding Poster Session 3
Rong Gao ⋅ Xin Liu ⋅ Zhuozhao Hu ⋅ Bohao Xing ⋅ Baiqiang XIA ⋅ Zitong YU ⋅ Heikki Kälviäinen
ExHall D Poster #281
MLVU: Benchmarking Multi-task Long Video Understanding Poster Session 3
Junjie Zhou ⋅ Yan Shu ⋅ Bo Zhao ⋅ Boya Wu ⋅ Zhengyang Liang ⋅ Shitao Xiao ⋅ Minghao Qin ⋅ Xi Yang ⋅ yongping xiong ⋅ Bo Zhang ⋅ Tiejun Huang ⋅ Zheng Liu
ExHall D Poster #291
Towards Understanding How Knowledge Evolves in Large Vision-Language Models Poster Session 6
Sudong Wang ⋅ Yunjian Zhang ⋅ Yao Zhu ⋅ Jianing Li ⋅ Zizhe Wang ⋅ Yanwei Liu ⋅ Xiangyang Ji
ExHall D Poster #355
A Unified, Resilient, and Explainable Adversarial Patch Detector Poster Session 6
Vishesh Kumar ⋅ Akshay Agarwal
ExHall D Poster #406
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Poster Session 1
Ryan Burgert ⋅ Yuancheng Xu ⋅ Wenqi Xian ⋅ Oliver Pilarski ⋅ Pascal Clausen ⋅ Mingming He ⋅ Li Ma ⋅ Yitong Deng ⋅ Lingxiao Li ⋅ Mohsen Mousavi ⋅ Michael Ryoo ⋅ Paul Debevec ⋅ Ning Yu
ExHall D Poster #174
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Poster Session 1
Weiming Ren ⋅ Huan Yang ⋅ Jie Min ⋅ Cong Wei ⋅ Wenhu Chen
ExHall D Poster #346
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Poster Session 4
Junbo Niu ⋅ Yifei Li ⋅ Ziyang Miao ⋅ Chunjiang Ge ⋅ Zhou Yuanhang ⋅ Qihao He ⋅ Xiaoyi Dong ⋅ Haodong Duan ⋅ Shuangrui Ding ⋅ Rui Qian ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Conghui He ⋅ Jiaqi Wang
ExHall D Poster #297
CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis Poster Session 6
Youngkyoon Jang ⋅ Eduardo Pérez-Pellitero
ExHall D Poster #61
NLPrompt: Noise-Label Prompt Learning for Vision-Language Models Poster Session 4
Bikang Pan ⋅ Qun Li ⋅ Xiaoying Tang ⋅ Wei Huang ⋅ Zhen Fang ⋅ Feng Liu ⋅ Jingya Wang ⋅ Jingyi Yu ⋅ Ye Shi
ExHall D Poster #397
PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields Poster Session 3
Sean Wu ⋅ Shamik Basu ⋅ Tim Broedermann ⋅ Luc Van Gool ⋅ Christos Sakaridis
ExHall D Poster #31
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Poster Session 1
Yuanbo Yang ⋅ Jiahao Shao ⋅ Xinyang Li ⋅ Yujun Shen ⋅ Andreas Geiger ⋅ Yiyi Liao
ExHall D Poster #258
No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition Poster Session 3
Rong Qin ⋅ Xin Liu ⋅ Xingyu Liu ⋅ Jiaxuan Liu ⋅ Jinglei Shi ⋅ Liang Lin ⋅ Jufeng Yang
ExHall D Poster #413
HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization Poster Session 1
Zitang Zhou ⋅ Ke Mei ⋅ Yu Lu ⋅ Tianyi Wang ⋅ Fengyun Rao
ExHall D Poster #286
Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression Poster Session 6
Zichong Meng ⋅ Yiming Xie ⋅ Xiaogang Peng ⋅ Zeyu Han ⋅ Huaizu Jiang
ExHall D Poster #161
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking Poster Session 1
Xuanyu Zhang ⋅ Zecheng Tang ⋅ Zhipei Xu ⋅ Runyi Li ⋅ Youmin Xu ⋅ Bin Chen ⋅ Feng Gao ⋅ Jian Zhang
ExHall D Poster #272
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large Language Models Poster Session 3
Hao Yin ⋅ Guangzong Si ⋅ Zilei Wang
ExHall D Poster #381
OW-OVD: Unified Open World and Open Vocabulary Object Detection Poster Session 5
Xing Xi ⋅ Yangyang Huang ⋅ Ronghua Luo ⋅ Yu Qiu
ExHall D Poster #421
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction Poster Session 5
Dongxu Wei ⋅ Zhiqi Li ⋅ Peidong Liu
ExHall D Poster #121
Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing Poster Session 5
Bingliang Zhang ⋅ Wenda Chu ⋅ Julius Berner ⋅ Chenlin Meng ⋅ Anima Anandkumar ⋅ Yang Song
ExHall D Poster #200
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification Poster Session 1
Xianwei Zhuang ⋅ Zhihong Zhu ⋅ Yuxin Xie ⋅ Liming Liang ⋅ Yuexian Zou
ExHall D Poster #384
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models Poster Session 1
Kevin Miller ⋅ Aditya Gangrade ⋅ Samarth Mishra ⋅ Kate Saenko ⋅ Venkatesh Saligrama
ExHall D Poster #398
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation Poster Session 3
Kunpeng Qiu ⋅ Zhiqiang Gao ⋅ Zhiying Zhou ⋅ MINGJIE SUN ⋅ Yongxin Guo
ExHall D Poster #481
DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection Poster Session 4
Jaewoo Song ⋅ Daemin Park ⋅ Kanghyun Baek ⋅ Sangyub Lee ⋅ Jooyoung Choi ⋅ Eunji Kim ⋅ Sungroh Yoon
ExHall D Poster #280
FedMIA: An Effective Membership Inference Attack Exploiting "All for One" Principle in Federated Learning Poster Session 4
Gongxi Zhu ⋅ Donghao Li ⋅ Hanlin Gu ⋅ Yuan Yao ⋅ Lixin Fan ⋅ Yuxing Han
ExHall D Poster #460
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework Poster Session 4
Henrique Morimitsu ⋅ Xiaobin Zhu ⋅ Roberto M. Cesar Jr ⋅ Xiangyang Ji ⋅ Xu-Cheng Yin
ExHall D Poster #191
Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints Poster Session 1
Yuhao Zhou ⋅ Yuxin Tian ⋅ Jindi Lv ⋅ Mingjia Shi ⋅ Yuanxi Li ⋅ Qing Ye ⋅ Shuhao Zhang ⋅ Jiancheng Lv
ExHall D Poster #448
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling Poster Session 3
Junha Hyung ⋅ Kinam Kim ⋅ Susung Hong ⋅ Min-Jung Kim ⋅ Jaegul Choo
ExHall D Poster #34
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Poster Session 1
Xudong LU ⋅ Yinghao Chen ⋅ chencheng Chen ⋅ Hui Tan ⋅ Boheng Chen ⋅ yina xie ⋅ Rui Hu ⋅ Guanxin tan ⋅ Renshou Wu ⋅ Yan Hu ⋅ Yi Zeng ⋅ Lei Wu ⋅ Liuyang Bian ⋅ Zhaoxiong Wang ⋅ Long Liu ⋅ Yanzhou Yang ⋅ Han Xiao ⋅ Aojun Zhou ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ Shuai Ren ⋅ Hongsheng Li
ExHall D Poster #379
Taming Teacher Forcing for Masked Autoregressive Video Generation Poster Session 2
Deyu Zhou ⋅ Quan Sun ⋅ Yuang Peng ⋅ Kun Yan ⋅ Runpei Dong ⋅ Duomin Wang ⋅ Zheng Ge ⋅ Nan Duan ⋅ Xiangyu Zhang
ExHall D Poster #192
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Poster Session 6
YONGWEI CHEN ⋅ Yushi Lan ⋅ Shangchen Zhou ⋅ Tengfei Wang ⋅ Xingang Pan
ExHall D Poster #210
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation Poster Session 5
Yiftach Edelstein ⋅ Or Patashnik ⋅ Dana Cohen-Bar ⋅ Lihi Zelnik-Manor
ExHall D Poster #39
URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration Poster Session 5
Rui Xu ⋅ Yuzhen Niu ⋅ Yuezhou Li ⋅ Huangbiao Xu ⋅ Wenxi Liu ⋅ Yuzhong Chen
ExHall D Poster #21
Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift Poster Session 2
Siyuan Liang ⋅ Jiawei Liang ⋅ Tianyu Pang ⋅ Chao Du ⋅ Aishan Liu ⋅ Mingli Zhu ⋅ Xiaochun Cao ⋅ Dacheng Tao
ExHall D Poster #391
Condensing Action Segmentation Datasets via Generative Network Inversion Poster Session 4
Guodong Ding ⋅ Rongyu Chen ⋅ Angela Yao
ExHall D Poster #184
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation Poster Session 2
Kaiyue Sun ⋅ Kaiyi Huang ⋅ Xian Liu ⋅ Yue Wu ⋅ Zihan Xu ⋅ Zhenguo Li ⋅ Xihui Liu
ExHall D Poster #290
Self-Evolving Visual Concept Library using Vision-Language Critics Poster Session 3
Atharva Sehgal ⋅ Patrick Yuan ⋅ Ziniu Hu ⋅ Yisong Yue ⋅ Jennifer J. Sun ⋅ Swarat Chaudhuri
ExHall D Poster #236
TFCustom: Customized Image Generation with Time-Aware Frequency Feature Guidance Poster Session 1
Mushui Liu ⋅ Dong She ⋅ Qihan Huang ⋅ Jiacheng Ying ⋅ Wanggui He ⋅ Jingxuan Pang ⋅ Yuanlei Hou ⋅ Siming Fu
ExHall D Poster #244
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models Poster Session 1
Subhadeep Koley ⋅ Tapas Kumar Dutta ⋅ Aneeshan Sain ⋅ Pinaki Nath Chowdhury ⋅ Ayan Kumar Bhunia ⋅ Yi-Zhe Song
ExHall D Poster #229
Invisible Backdoor Attack against Self-supervised Learning Poster Session 5
Hanrong Zhang ⋅ Zhenting Wang ⋅ Boheng Li ⋅ Fulin Lin ⋅ Tingxu Han ⋅ Mingyu Jin ⋅ Chenlu Zhan ⋅ Mengnan Du ⋅ Hongwei Wang ⋅ Shiqing Ma
ExHall D Poster #455
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer Poster Session 5
Yuzhou Liu ⋅ Lingjie Zhu ⋅ Hanqiao Ye ⋅ Shangfeng Huang ⋅ Xiang Gao ⋅ Xianwei Zheng ⋅ Shuhan Shen
ExHall D Poster #111
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models Poster Session 5
Jinjin Zhang ⋅ qiuyu Huang ⋅ Junjie Liu ⋅ Xiefan Guo ⋅ Di Huang
ExHall D Poster #230
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement Poster Session 1
Yun Liu ⋅ Chengwen Zhang ⋅ Ruofan Xing ⋅ Bingda Tang ⋅ Bowen Yang ⋅ Li Yi
ExHall D Poster #149
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram Poster Session 6
Sifan Zhou ⋅ Zhihang Yuan ⋅ Dawei Yang ⋅ Ziyu Zhao ⋅ Jian Qian ⋅ Xing Hu
ExHall D Poster #113
POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality Poster Session 1
Joey Wilson ⋅ Marcelino M. de Almeida ⋅ Sachit Mahajan ⋅ Martin Labrie ⋅ Maani Ghaffari ⋅ Omid Ghasemalizadeh ⋅ Min Sun ⋅ Cheng-Hao Kuo ⋅ Arnab Sen
ExHall D Poster #331
StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements Poster Session 5
Mingkun Lei ⋅ Xue Song ⋅ Beier Zhu ⋅ Hao Wang ⋅ Chi Zhang
ExHall D Poster #228
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility Poster Session 4
Yidi Li ⋅ Jun Xiao ⋅ Zhengda Lu ⋅ Yiqun Wang ⋅ Haiyong Jiang
ExHall D Poster #263
Semantic and Expressive Variations in Image Captions Across Languages Poster Session 6
Andre Ye ⋅ Sebastin Santy ⋅ Jena D. Hwang ⋅ Amy X Zhang ⋅ Ranjay Krishna
ExHall D Poster #337
ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models Poster Session 5
Xubing Ye ⋅ Yukang Gan ⋅ Yixiao Ge ⋅ Xiao-Ping Zhang ⋅ Yansong Tang
ExHall D Poster #376
CroCoDL: Cross-device Collaborative Dataset for Localization Poster Session 6
Hermann Blum ⋅ Alessandro Mercurio ⋅ Joshua O'Reilly ⋅ Tim Engelbracht ⋅ Mihai Dusmanu ⋅ Marc Pollefeys ⋅ Zuria Bauer
ExHall D Poster #121
Glossy Object Reconstruction with Cost-effective Polarized Acquisition Poster Session 1
Bojian Wu ⋅ YIFAN PENG ⋅ Ruizhen Hu ⋅ Xiaowei Zhou
ExHall D Poster #24
PURA: Parameter Update-Recovery Test-Time Adaption for RGB-T Tracking Poster Session 5
Zekai Shao ⋅ Yufan Hu ⋅ Bin Fan ⋅ Hongmin Liu
ExHall D Poster #99
Generalizable Object Keypoint Localization from Generative Priors Poster Session 4
Dongkai Wang ⋅ Jiang Duan ⋅ Liangjian Wen ⋅ Shiyu Xuan ⋅ Hao CHEN ⋅ Shiliang Zhang
ExHall D Poster #425
L-SWAG: Layer-Sample Wise Activation with Gradients Information for Zero-Shot NAS on Vision Transformers Poster Session 1
Sofia Casarin ⋅ Sergio Escalera ⋅ Oswald Lanz
ExHall D Poster #410
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts Poster Session 2
Qizhou Chen ⋅ Chengyu Wang ⋅ Dakan Wang ⋅ Taolin Zhang ⋅ Wangyue Li ⋅ Xiaofeng He
ExHall D Poster #389
PartGen: Part-level 3D Generation and Reconstruction with Multi-view Diffusion Models Poster Session 2
Minghao Chen ⋅ Roman Shapovalov ⋅ Iro Laina ⋅ Tom Monnier ⋅ Jianyuan Wang ⋅ David Novotny ⋅ Andrea Vedaldi
ExHall D Poster #42
Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute Loss Poster Session 5
Ravishankar Evani ⋅ Deepu Rajan ⋅ Shangbo Mao
ExHall D Poster #226
FedCALM: Conflict-aware Layer-wise Mitigation for Selective Aggregation in Deeper Personalized Federated Learning Poster Session 3
Hao Zheng ⋅ Zhigang Hu ⋅ Boyu Wang ⋅ Liu Yang ⋅ Meiguang Zheng ⋅ Aikun Xu
ExHall D Poster #459
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting Poster Session 4
Hengyu Liu ⋅ Yuehao Wang ⋅ Chenxin Li ⋅ Ruisi Cai ⋅ Kevin Wang ⋅ Wuyang Li ⋅ Pavlo Molchanov ⋅ Peihao Wang ⋅ Zhangyang Wang
ExHall D Poster #47
Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters Poster Session 6
Yuan Wang ⋅ Ouxiang Li ⋅ Tingting Mu ⋅ Yanbin Hao ⋅ Kuien Liu ⋅ Xiang Wang ⋅ Xiangnan He
ExHall D Poster #247
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation Poster Session 5
Kun Liu ⋅ Qi Liu ⋅ Xinchen Liu ⋅ Jie Li ⋅ Yongdong Zhang ⋅ Jiebo Luo ⋅ Xiaodong He ⋅ Wu Liu
ExHall D Poster #285
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation Poster Session 3
Lijun Li ⋅ Zhelun Shi ⋅ Xuhao Hu ⋅ Bowen Dong ⋅ Yiran Qin ⋅ Xihui Liu ⋅ Lu Sheng ⋅ Jing Shao
ExHall D Poster #260
Order-One Rolling Shutter Cameras Poster Session 6
Marvin Anas Hahn ⋅ Kathlén Kohn ⋅ Orlando Marigliano ⋅ Tomas Pajdla
ExHall D Poster #82
Synthetic Prior for Few-Shot Drivable Head Avatar Inversion Poster Session 3
Wojciech Zielonka ⋅ Stephan J. Garbin ⋅ Alexandros Lattas ⋅ George Kopanas ⋅ Paulo Gotardo ⋅ Thabo Beeler ⋅ Justus Thies ⋅ Timo Bolkart
ExHall D Poster #8
Focusing on Tracks for Online Multi-Object Tracking Poster Session 3
Kyujin Shim ⋅ Kangwook Ko ⋅ YuJin Yang ⋅ Changick Kim
ExHall D Poster #100
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment Poster Session 5
Yan Li ⋅ Yifei Xing ⋅ Xiangyuan Lan ⋅ Xin Li ⋅ Haifeng Chen ⋅ Dongmei Jiang
ExHall D Poster #358
VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models Poster Session 6
Dahun Kim ⋅ AJ Piergiovanni ⋅ Ganesh Satish Mallya ⋅ Anelia Angelova
ExHall D Poster #279
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing Poster Session 2
Feifei Shao ⋅ Ping Liu ⋅ Zhao Wang ⋅ Yawei Luo ⋅ Hongwei Wang ⋅ Jun Xiao
ExHall D Poster #119
Can Text-to-Video Generation help Video-Language Alignment? Poster Session 5
Luca Zanella ⋅ Massimiliano Mancini ⋅ Willi Menapace ⋅ Sergey Tulyakov ⋅ Yiming Wang ⋅ Elisa Ricci
ExHall D Poster #294
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition Poster Session 2
Yifei Zhang ⋅ Chang Liu ⋅ Jin Wei ⋅ Xiaomeng Yang ⋅ Yu ZHOU ⋅ Can Ma ⋅ Xiangyang Ji
ExHall D Poster #376
Scaling up Image Segmentation across Data and Tasks Poster Session 1
Pei Wang ⋅ Zhaowei Cai ⋅ Hao Yang ⋅ Ashwin Swaminathan ⋅ R. Manmatha ⋅ Stefano Soatto
ExHall D Poster #422
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation Poster Session 5
Shuwei Shi ⋅ Biao Gong ⋅ Xi Chen ⋅ DanDan Zheng ⋅ Shuai Tan ⋅ Zizheng Yang ⋅ Yuyuan Li ⋅ Jingwen He ⋅ Kecheng Zheng ⋅ Jingdong Chen ⋅ Ming Yang ⋅ Yinqiang Zheng
ExHall D Poster #172
Take the Bull by the Horns: Learning to Segment Hard Samples Poster Session 3
Yuan Guo ⋅ Jingyu Kong ⋅ Yu Wang ⋅ Yuping Duan
ExHall D Poster #478
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning Poster Session 2
Bozhou Zhang ⋅ Nan Song ⋅ Xin Jin ⋅ Li Zhang
ExHall D Poster #142
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation Poster Session 4
Diljeet Jagpal ⋅ Xi Chen ⋅ Vinay P. Namboodiri
ExHall D Poster #231
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts Poster Session 2
Yu Cao ⋅ Zengqun Zhao ⋅ Ioannis Patras ⋅ Shaogang Gong
ExHall D Poster #224
Cross-View Completion Models are Zero-shot Correspondence Estimators Poster Session 1
Honggyu An ⋅ Jin Hyeon Kim ⋅ Seonghoon Park ⋅ Sunghwan Hong ⋅ Jaewoo Jung ⋅ Jisang Han ⋅ Seungryong Kim
ExHall D Poster #87
Multi-party Collaborative Attention Control for Image Customization Poster Session 2
Han Yang ⋅ Chuanguang Yang ⋅ Qiuli Wang ⋅ Zhulin An ⋅ Weilun Feng ⋅ Libo Huang ⋅ Yongjun Xu
ExHall D Poster #246
Reproducible Vision-Language Models Meet Concepts Out of Pre-Training Poster Session 3
Ziliang Chen ⋅ Xin Huang ⋅ Xiaoxuan Fan ⋅ Keze Wang ⋅ Yuyu Zhou ⋅ Quanlong Guan ⋅ Liang Lin
ExHall D Poster #388
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Poster Session 4
Guy Yariv ⋅ Yuval Kirstain ⋅ Amit Zohar ⋅ Shelly Sheynin ⋅ Yaniv Taigman ⋅ Yossi Adi ⋅ Sagie Benaim ⋅ Adam Polyak
ExHall D Poster #228
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data Poster Session 6
Zengqun Zhao ⋅ Ziquan Liu ⋅ Yu Cao ⋅ Shaogang Gong ⋅ Ioannis Patras
ExHall D Poster #246
ReWind: Understanding Long Videos with Instructed Learnable Memory Poster Session 3
Anxhelo Diko ⋅ Tinghuai Wang ⋅ Wassim Swaileh ⋅ Shiyan Sun ⋅ Ioannis Patras
ExHall D Poster #295
Segment Anything, Even Occluded Poster Session 6
Wei-En Tai ⋅ Yu-Lin Shih ⋅ Cheng Sun ⋅ Yu-Chiang Frank Wang ⋅ Hwann-Tzong Chen
ExHall D Poster #309
Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging Poster Session 4
Xianrui Li ⋅ Yufei Cui ⋅ Jun Li ⋅ Antoni B. Chan
ExHall D Poster #475
ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects Poster Session 2
Woojin Lee ⋅ Hyugjae Chang ⋅ Jaeho Moon ⋅ Jaehyup Lee ⋅ Munchurl Kim
ExHall D Poster #332
Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks Poster Session 4
Yu Zhou ⋅ Dian Zheng ⋅ Qijie Mo ⋅ Ren-Jie Lu ⋅ Kun-Yu Lin ⋅ Wei-Shi Zheng
ExHall D Poster #433
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions Poster Session 3
Wang Yu-Hang ⋅ Junkang Guo ⋅ Aolei Liu ⋅ Kaihao Wang ⋅ Zaitong Wu ⋅ Zhenyu Liu ⋅ Wenfei Yin ⋅ Jian Liu
ExHall D Poster #462
LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty Poster Session 2
Christoforos N. Spartalis ⋅ Theodoros Semertzidis ⋅ Efstratios Gavves ⋅ Petros Daras
ExHall D Poster #445
DiffFNO: Diffusion Fourier Neural Operator Poster Session 1
Xiaoyi Liu ⋅ Hao Tang
ExHall D Poster #195
SEC-Prompt:SEmantic Complementary Prompting for Few-Shot Class-Incremental Learning Poster Session 5
Ye Liu ⋅ Meng Yang
ExHall D Poster #440
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner Poster Session 2
Weiyu Li ⋅ Jiarui Liu ⋅ Hongyu Yan ⋅ Rui Chen ⋅ Yixun Liang ⋅ Xuelin Chen ⋅ Ping Tan ⋅ Xiaoxiao Long
ExHall D Poster #40
Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition Poster Session 1
Juncheng Wang ⋅ Chao Xu ⋅ Cheng Yu ⋅ Lei Shang ⋅ Zhe Hu ⋅ Shujun Wang ⋅ Liefeng Bo
ExHall D Poster #282
Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition Poster Session 3
Anqi Zhu ⋅ Jingmin Zhu ⋅ James Bailey ⋅ Mingming Gong ⋅ Qiuhong Ke
ExHall D Poster #308
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting Poster Session 3
Chengyou Jia ⋅ Changliang Xia ⋅ Zhuohang Dang ⋅ Weijia Wu ⋅ Hangwei Qian ⋅ Minnan Luo
ExHall D Poster #251
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders Poster Session 4
Rui Chen ⋅ Jianfeng Zhang ⋅ Yixun Liang ⋅ Guan Luo ⋅ Weiyu Li ⋅ Jiarui Liu ⋅ Xiu Li ⋅ Xiaoxiao Long ⋅ Jiashi Feng ⋅ Ping Tan
ExHall D Poster #38
VEU-Bench: Towards Comprehensive Understanding of Video Editing Poster Session 3
Bozheng Li ⋅ Yongliang Wu ⋅ YI LU ⋅ Jiashuo Yu ⋅ Licheng Tang ⋅ Jiawang Cao ⋅ Wenqing Zhu ⋅ Yuyang Sun ⋅ Jay Wu ⋅ Wenbo Zhu
ExHall D Poster #289
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution Poster Session 4
Shian Du ⋅ Menghan Xia ⋅ Chang Liu ⋅ Xintao Wang ⋅ Jing Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Xiangyang Ji
ExHall D Poster #190
FluxSpace: Disentangled Semantic Editing in Rectified Flow Models Poster Session 3
Yusuf Dalva ⋅ Kavana Venkatesh ⋅ Pinar Yanardag
ExHall D Poster #232
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views Poster Session 4
Chong Bao ⋅ Xiyu Zhang ⋅ Zehao Yu ⋅ Jiale Shi ⋅ Guofeng Zhang ⋅ Songyou Peng ⋅ Zhaopeng Cui
ExHall D Poster #51
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction Poster Session 2
Yuanbo Wang ⋅ Zhaoxuan Zhang ⋅ Jiajin Qiu ⋅ Dilong Sun ⋅ Zhengyu Meng ⋅ Xiaopeng Wei ⋅ Xin Yang
ExHall D Poster #20
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification Poster Session 5
Xiangyan Qu ⋅ Gaopeng Gou ⋅ Jiamin Zhuang ⋅ Jing Yu ⋅ Kun Song ⋅ Qihao Wang ⋅ Yili Li ⋅ Gang Xiong
ExHall D Poster #392
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts Poster Session 3
Dmitrii M Petrov ⋅ Pradyumn Goyal ⋅ Divyansh Shivashok ⋅ Yuanming Tao ⋅ Melinos Averkiou ⋅ Evangelos Kalogerakis
ExHall D Poster #253
Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events Poster Session 5
Aditya Chinchure ⋅ Sahithya Ravi ⋅ Raymond Ng ⋅ Vered Shwartz ⋅ Boyang Li ⋅ Leonid Sigal
ExHall D Poster #304
TANGO: Training-free Embodied AI Agents for Open-world Tasks Poster Session 5
Filippo Ziliotto ⋅ Tommaso Campari ⋅ Luciano Serafini ⋅ Lamberto Ballan
ExHall D Poster #342
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation Poster Session 3
Yuan Gan ⋅ Jiaxu Miao ⋅ Yunze Wang ⋅ Yi Yang
ExHall D Poster #266
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning Poster Session 3
Bardia Safaei ⋅ Faizan Siddiqui ⋅ Jiacong Xu ⋅ Vishal M. Patel ⋅ Shao-Yuan Lo
ExHall D Poster #344
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness Poster Session 5
Yiming Zhong ⋅ Qi Jiang ⋅ Jingyi Yu ⋅ Yuexin Ma
ExHall D Poster #145
Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks Poster Session 2
Haijin Zeng ⋅ Xiangming Wang ⋅ Yongyong Chen ⋅ Jingyong Su ⋅ Jie Liu
ExHall D Poster #206
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer Poster Session 1
Jiajun Deng ⋅ Tianyu He ⋅ Li Jiang ⋅ Tianyu Wang ⋅ Feras Dayoub ⋅ Ian Reid
ExHall D Poster #343
VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks Poster Session 6
Jinseong Jang ⋅ Chunfei Ma ⋅ Byeongwon Lee
ExHall D Poster #375
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction Poster Session 4
Yuan Zhou ⋅ Qingshan Xu ⋅ Jiequan Cui ⋅ Junbao Zhou ⋅ Jing Zhang ⋅ Richang Hong ⋅ Hanwang Zhang
ExHall D Poster #413
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation Poster Session 5
Zilyu Ye ⋅ Zhiyang Chen ⋅ Tiancheng Li ⋅ Zemin Huang ⋅ Weijian Luo ⋅ Guo-Jun Qi
ExHall D Poster #225
All-directional Disparity Estimation for Real-world QPD Images Poster Session 5
Hongtao Yu ⋅ Shaohui Song ⋅ Lihu Sun ⋅ Wenkai Su ⋅ Xiaodong Yang ⋅ Chengming Liu
ExHall D Poster #75
LC-Mamba: Local and Continuous Mamba with Shifted Windows for Frame Interpolation Poster Session 4
Min Wu Jeong ⋅ Chae Eun Rhee
ExHall D Poster #178
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment Poster Session 4
Edson Araujo ⋅ Andrew Rouditchenko ⋅ Yuan Gong ⋅ Saurabhchand Bhati ⋅ Samuel Thomas ⋅ Brian Kingsbury ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ James Glass ⋅ Hilde Kuehne
ExHall D Poster #287
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation Poster Session 4
Arnav Mohanty Das ⋅ Gantavya Bhatt ⋅ Lilly Kumari ⋅ Sahil Verma ⋅ Jeff Bilmes
ExHall D Poster #450
Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Poster Session 6
Zhengdi Yu ⋅ Stefanos Zafeiriou ⋅ Tolga Birdal
ExHall D Poster #148
MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting Poster Session 3
Mengqiu XU ⋅ Kaixin Chen ⋅ Heng Guo ⋅ Yixiang Huang ⋅ Ming Wu ⋅ Zhenwei Shi ⋅ Chuang Zhang ⋅ Jun Guo
ExHall D Poster #190
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning Poster Session 4
Da-Wei Zhou ⋅ Zi-Wen Cai ⋅ Han-Jia Ye ⋅ Lijun Zhang ⋅ De-Chuan Zhan
ExHall D Poster #451
EmoEdit: Evoking Emotions through Image Manipulation Poster Session 5
Jingyuan Yang ⋅ Jiawei Feng ⋅ Weibin Luo ⋅ Dani Lischinski ⋅ Daniel Cohen-Or ⋅ Hui Huang
ExHall D Poster #350
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D Poster Session 1
Wei Cheng ⋅ Juncheng Mu ⋅ Xianfang Zeng ⋅ Xin Chen ⋅ Anqi Pang ⋅ Chi Zhang ⋅ Zhibin Wang ⋅ Bin Fu ⋅ Gang Yu ⋅ Ziwei Liu ⋅ Liang Pan
ExHall D Poster #39
PIDSR: Complementary Polarized Image Demosaicing and Super-Resolution Poster Session 4
Shuangfan Zhou ⋅ Chu Zhou ⋅ Youwei Lyu ⋅ Heng Guo ⋅ Zhanyu Ma ⋅ Boxin Shi ⋅ Imari Sato
ExHall D Poster #21
Video-Bench: Human-Aligned Video Generation Benchmark Poster Session 4
Hui Han ⋅ Siyuan Li ⋅ Jiaqi Chen ⋅ Yiwen Yuan ⋅ Yuling Wu ⋅ Yufan Deng ⋅ Chak Tou Leong ⋅ Hanwen Du ⋅ Junchen Fu ⋅ Youhua Li ⋅ Jie Zhang ⋅ Chi Zhang ⋅ Li-jia Li ⋅ Yongxin Ni
ExHall D Poster #293
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning Poster Session 3
Kunyu Wang ⋅ Xueyang Fu ⋅ Xin Lu ⋅ Chengjie Ge ⋅ Chengzhi Cao ⋅ Wei Zhai ⋅ Zheng-Jun Zha
ExHall D Poster #419
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Poster Session 3
Xin Wen ⋅ Bingchen Zhao ⋅ Yilun Chen ⋅ Jiangmiao Pang ⋅ Xiaojuan Qi
ExHall D Poster #144
TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition Poster Session 1
yilong wang ⋅ Zilin Gao ⋅ Qilong Wang ⋅ Zhaofeng Chen ⋅ Peihua Li ⋅ Qinghua Hu
ExHall D Poster #313
BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology Poster Session 2
Amaya Gallagher-Syed ⋅ Henry Senior ⋅ Omnia Alwazzan ⋅ Elena Pontarini ⋅ Michele Bombardieri ⋅ Costantino Pitzalis ⋅ Myles J. Lewis ⋅ Michael R Barnes ⋅ Luca Rossi ⋅ Greg Slabaugh
ExHall D Poster #476
Dynamic Pseudo Labeling via Gradient Cutting for High-Low Entropy Exploration Poster Session 4
Jae Hyeon Park ⋅ Joo Hyeon Jeon ⋅ Jae Yun Lee ⋅ Sangyeon Ahn ⋅ MinHee Cha ⋅ Min Geol Kim ⋅ Hyeok Nam ⋅ Sung In Cho
ExHall D Poster #456
VODiff: Controlling Object Visibility Order in Text-to-Image Generation Poster Session 4
Dong Liang ⋅ Jinyuan Jia ⋅ Yuhao Liu ⋅ Zhanghan Ke ⋅ Hongbo Fu ⋅ Rynson W.H. Lau
ExHall D Poster #246
CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation Poster Session 4
Jiahao Li ⋅ Weijian Ma ⋅ Xueyang Li ⋅ Yunzhong Lou ⋅ Guichun Zhou ⋅ Xiangdong Zhou
ExHall D Poster #266
Seeing the Abstract: Translating the Abstract Language for Vision Language Models Poster Session 2
Davide Talon ⋅ Federico Girella ⋅ Ziyue Liu ⋅ Marco Cristani ⋅ Yiming Wang
ExHall D Poster #370
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models Poster Session 5
Yifan Liu ⋅ Keyu Fan ⋅ Weihao Yu ⋅ Chenxin Li ⋅ Hao Lu ⋅ Yixuan Yuan
ExHall D Poster #49
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving Poster Session 3
Xuesong Chen ⋅ Linjiang Huang ⋅ Tao Ma ⋅ Rongyao Fang ⋅ Shaoshuai Shi ⋅ Hongsheng Li
ExHall D Poster #137
Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection Poster Session 4
Ji Du ⋅ Fangwei Hao ⋅ Mingyang Yu ⋅ Desheng Kong ⋅ Jiesheng Wu ⋅ Bin Wang ⋅ Jing XU ⋅ Ping Li
ExHall D Poster #331
Probability Density Geodesics in Image Diffusion Latent Space Poster Session 6
Qingtao Yu ⋅ Jaskirat Singh ⋅ Zhaoyuan Yang ⋅ Peter Henry Tu ⋅ Jing Zhang ⋅ Richard Hartley ⋅ Hongdong Li ⋅ Dylan Campbell
ExHall D Poster #173
High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding Poster Session 1
Yuanqi Li ⋅ Jingcheng Huang ⋅ Hongshen Wang ⋅ Peiyuan Lv ⋅ Yansong Liu ⋅ Jiuming Zheng ⋅ Jie Guo ⋅ Yanwen Guo
ExHall D Poster #104
DriveScape: High-Resolution Driving Video Generation by Multi-View Feature Fusion Poster Session 4
Wei Wu ⋅ Xi Guo ⋅ Weixuan TANG ⋅ Tingxuan Huang ⋅ Chiyu Wang ⋅ Chenjing Ding
ExHall D Poster #131
Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset Poster Session 2
Minshan Xie ⋅ Jian Lin ⋅ Hanyuan Liu ⋅ Chengze Li ⋅ Tien-Tsin Wong
ExHall D Poster #335
Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights Poster Session 3
Ondrej Tybl ⋅ Lukas Neumann
ExHall D Poster #404
Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond Poster Session 4
Guanyao Wu ⋅ Haoyu Liu ⋅ Hongming Fu ⋅ Yichuan Peng ⋅ Jinyuan Liu ⋅ Xin Fan ⋅ Risheng Liu
ExHall D Poster #198
EgoLife: Towards Egocentric Life Assistant Poster Session 6
Jingkang Yang ⋅ Shuai Liu ⋅ Hongming Guo ⋅ Yuhao Dong ⋅ Xiamengwei Zhang ⋅ Sicheng Zhang ⋅ Pengyun Wang ⋅ Zitang Zhou ⋅ Binzhu Xie ⋅ Ziyue Wang ⋅ Bei Ouyang ⋅ Zhengyu Lin ⋅ Marco Cominelli ⋅ Zhongang Cai ⋅ Bo Li ⋅ Yuanhan Zhang ⋅ Peiyuan Zhang ⋅ Fangzhou Hong ⋅ Joerg Widmer ⋅ Francesco Gringoli ⋅ Lei Yang ⋅ Ziwei Liu
ExHall D Poster #259
RAEncoder: A Label-Free Reversible Adversarial Examples Encoder for Dataset Intellectual Property Protection Poster Session 4
Fan Xing ⋅ Zhuo Tian ⋅ Xuefeng Fan ⋅ Xiaoyi Zhou
ExHall D Poster #462
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics Poster Session 1
Kun Yang ⋅ Yuxiang Liu ⋅ Zeyu Cui ⋅ Yu Liu ⋅ Maojun Zhang ⋅ Shen Yan ⋅ Qing Wang
ExHall D Poster #49
BrepGiff: Lightweight Generation of Complex B-rep with 3D GAT Diffusion Poster Session 6
Hao Guo ⋅ Xiaoshui Huang ⋅ Hao jiacheng ⋅ Yunpeng Bai ⋅ Hongping Gan ⋅ Yilei Shi
ExHall D Poster #41
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition Poster Session 6
ZHANG LINTONG ⋅ Kang Yin ⋅ Seong-Whan Lee
ExHall D Poster #373
Progressive Correspondence Regenerator for Robust 3D Registration Poster Session 1
Guiyu Zhao ⋅ Sheng Ao ⋅ Ye Zhang ⋅ Kai Xu ⋅ Yulan Guo
ExHall D Poster #97
Reference-Based 3D-Aware Image Editing with Triplanes Poster Session 2
Bahri Batuhan Bilecen ⋅ Yiğit Yalın ⋅ Ning Yu ⋅ Aysegul Dundar
ExHall D Poster #44
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior Poster Session 2
Junfeng Ni ⋅ Yu Liu ⋅ Ruijie Lu ⋅ ZiRui Zhou ⋅ Song-Chun Zhu ⋅ Yixin Chen ⋅ Siyuan Huang
ExHall D Poster #55
Unlocking Generalization Power in LiDAR Point Cloud Registration Poster Session 5
Zhenxuan Zeng ⋅ Qiao Wu ⋅ Xiyu Zhang ⋅ Lin Yuanbo Wu ⋅ Pei An ⋅ Jiaqi Yang ⋅ Ji Wang ⋅ Peng Wang
ExHall D Poster #114
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection Poster Session 4
Fuyun Wang ⋅ Tong Zhang ⋅ Yuanzhi Wang ⋅ Yide Qiu ⋅ Xin Liu ⋅ Xu Guo ⋅ Zhen Cui
ExHall D Poster #439
Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models Poster Session 1
Hao Cheng ⋅ Erjia Xiao ⋅ Jiayan Yang ⋅ Jiahang Cao ⋅ Qiang Zhang ⋅ Jize Zhang ⋅ Kaidi Xu ⋅ Jindong Gu ⋅ Renjing Xu
ExHall D Poster #271
3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting Poster Session 6
Qi Wu ⋅ Janick Martinez Esturo ⋅ Ashkan Mirzaei ⋅ Nicolas Moënne-Loccoz ⋅ Žan Gojčič
ExHall D Poster #28
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Poster Session 1
Sheng Zhou ⋅ Junbin Xiao ⋅ Qingyun Li ⋅ Yicong Li ⋅ Xun Yang ⋅ Dan Guo ⋅ Meng Wang ⋅ Tat-seng Chua ⋅ Angela Yao
ExHall D Poster #305
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts Poster Session 2
Adnen Abdessaied ⋅ Anna Rohrbach ⋅ Marcus Rohrbach ⋅ Andreas Bulling
ExHall D Poster #312
Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives Poster Session 5
Alex Hanson ⋅ Allen Tu ⋅ Geng Lin ⋅ Vasu Singla ⋅ Matthias Zwicker ⋅ Tom Goldstein
ExHall D Poster #46
Mamba-Reg: Vision Mamba Also Needs Registers Poster Session 3
Feng Wang ⋅ Jiahao Wang ⋅ Sucheng Ren ⋅ Guoyizhe Wei ⋅ Jieru Mei ⋅ Wei Shao ⋅ Yuyin Zhou ⋅ Alan L. Yuille ⋅ Cihang Xie
ExHall D Poster #411
It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data Poster Session 5
Dominik Schnaus ⋅ Nikita Araslanov ⋅ Daniel Cremers
ExHall D Poster #377
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Poster Session 4
Enric Corona ⋅ Andrei Zanfir ⋅ Eduard Gabriel Bazavan ⋅ NIKOS KOLOTOUROS ⋅ Thiemo Alldieck ⋅ Cristian Sminchisescu
ExHall D Poster #4
SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning Poster Session 5
Seokju Yun ⋅ Seunghye Chae ⋅ Dongheon Lee ⋅ Youngmin Ro
ExHall D Poster #436
DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes Poster Session 5
Ashish Kumar ⋅ A. N. Rajagopalan
ExHall D Poster #64
Composing Parts for Expressive Object Generation Poster Session 3
Harsh Rangwani ⋅ Aishwarya Agarwal ⋅ Kuldeep Kulkarni ⋅ R. Venkatesh Babu ⋅ Srikrishna Karanam
ExHall D Poster #244
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration Poster Session 5
Chao Wang ⋅ Hehe Fan ⋅ Huichen Yang ⋅ Sarvnaz Karimi ⋅ Lina Yao ⋅ Yi Yang
ExHall D Poster #237
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images Poster Session 1
Jiuchen Chen ⋅ Xinyu Yan ⋅ Qizhi Xu ⋅ Kaiqi Li
ExHall D Poster #197
UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image Poster Session 5
Xingyu Liu ⋅ Gu Wang ⋅ Ruida Zhang ⋅ Chenyangguang Zhang ⋅ Federico Tombari ⋅ Xiangyang Ji
ExHall D Poster #93
Tuning the Frequencies: Robust Training for Sinusoidal Neural Networks Poster Session 1
Tiago Novello ⋅ Diana Aldana Moreno ⋅ André Araujo ⋅ Luiz Velho
ExHall D Poster #278
Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures Poster Session 1
Guoxing Sun ⋅ Rishabh Dabral ⋅ Heming Zhu ⋅ Pascal Fua ⋅ Christian Theobalt ⋅ Marc Habermann
ExHall D Poster #37
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection Poster Session 4
Jiangyi Wang ⋅ Na Zhao
ExHall D Poster #431
CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design Poster Session 6
Weitao Feng ⋅ Hang Zhou ⋅ Jing Liao ⋅ Li Cheng ⋅ Wenbo Zhou
ExHall D Poster #289
Identity-Clothing Similarity Modeling for Unsupervised Clothing Change Person Re-Identification Poster Session 4
Zhiqi Pang ⋅ Junjie Wang ⋅ Lingling Zhao ⋅ Chunyu Wang
ExHall D Poster #329
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation Poster Session 5
Ziqin Huang ⋅ Gu Wang ⋅ Chenyangguang Zhang ⋅ Ruida Zhang ⋅ Xiu Li ⋅ Xiangyang Ji
ExHall D Poster #96
3D Prior Is All You Need: Cross-Task Few-shot 2D Gaze Estimation Poster Session 5
Yihua Cheng ⋅ Hengfei Wang ⋅ Zhongqun Zhang ⋅ Yang Yue ⋅ Boeun Kim ⋅ Feng Lu ⋅ Hyung Jin Chang
ExHall D Poster #275
Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving Poster Session 3
Alexey Nekrasov ⋅ Malcolm Burdorf ⋅ Stewart Worrall ⋅ Bastian Leibe ⋅ Julie Stephany Berrio Perez
ExHall D Poster #119
Is `Right' Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning Poster Session 3
JiHyeok Jung ⋅ EunTae Kim ⋅ SeoYeon Kim ⋅ Joo Ho Lee ⋅ Bumsoo Kim ⋅ Buru Chang
ExHall D Poster #345
GCC: Generative Color Constancy via Diffusing a Color Checker Poster Session 3
Chen-Wei Chang ⋅ Cheng-De Fan ⋅ Chia-Che Chang ⋅ Yi-Chen Lo ⋅ Yu-Chee Tseng ⋅ Jiun-Long Huang ⋅ Yu-Lun Liu
ExHall D Poster #20
Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model Poster Session 5
Shuyun Wang ⋅ Hu Zhang ⋅ Xin Shen ⋅ Dadong Wang ⋅ Xin Yu
ExHall D Poster #182
On Denoising Walking Videos for Gait Recognition Poster Session 3
Dongyang Jin ⋅ Chao Fan ⋅ Jingzhe Ma ⋅ Jingkai Zhou ⋅ Weihua Chen ⋅ Shiqi Yu
ExHall D Poster #162
Conformal Prediction for Zero-Shot Models Poster Session 4
Julio Silva-Rodríguez ⋅ Ismail Ben Ayed ⋅ Jose Dolz
ExHall D Poster #393
PhysAnimator: Physics-Guided Generative Cartoon Animation Poster Session 3
Tianyi Xie ⋅ Yiwei Zhao ⋅ Ying Jiang ⋅ Chenfanfu Jiang
ExHall D Poster #13
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding Poster Session 6
chenkai zhang ⋅ Yiming Lei ⋅ Zeming Liu ⋅ Haitao Leng ⋅ Shaoguo Liu ⋅ Tingting Gao ⋅ Qingjie Liu ⋅ Yunhong Wang
ExHall D Poster #273
Weakly Supervised Temporal Action Localization via Dual-Prior Collaborative Learning Guided by Multimodal Large Language Models Poster Session 5
Quan Zhang ⋅ Jinwei Fang ⋅ Rui Yuan ⋅ Xi Tang ⋅ Yuxin Qi ⋅ Ke Zhang ⋅ Chun Yuan
ExHall D Poster #298
HotSpot: Signed Distance Function Optimization with an Asymptotically Sufficient Condition Poster Session 1
Zimo Wang ⋅ Cheng Wang ⋅ Taiki Yoshino ⋅ Sirui Tao ⋅ Ziyang Fu ⋅ Tzu-Mao Li
ExHall D Poster #103
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Poster Session 3
Zhantao Yang ⋅ Ruili Feng ⋅ Keyu Yan ⋅ Huangji Wang ⋅ Zhicai Wang ⋅ Shangwen Zhu ⋅ Han Zhang ⋅ Jie Xiao ⋅ Pingyu Wu ⋅ Kai Zhu ⋅ Jixuan Chen ⋅ Chen-Wei Xie ⋅ Yue Yang ⋅ Hongyang Zhang ⋅ Yu Liu ⋅ Fan Cheng
ExHall D Poster #356
EntitySAM: Segment Everything in Video Poster Session 5
Mingqiao Ye ⋅ Seoung Wug Oh ⋅ Lei Ke ⋅ Joon-Young Lee
ExHall D Poster #307
Scene-agnostic Pose Regression for Visual Localization Poster Session 6
Junwei Zheng ⋅ Ruiping Liu ⋅ Yufan Chen ⋅ Zhenfang Chen ⋅ Kailun Yang ⋅ Jiaming Zhang ⋅ Rainer Stiefelhagen
ExHall D Poster #90
GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction Poster Session 5
Jinguang Tong ⋅ Xuesong li ⋅ Fahira Afzal Maken ⋅ Sundaram Muthu ⋅ Lars Petersson ⋅ Chuong Nguyen ⋅ Hongdong Li
ExHall D Poster #47
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model Poster Session 2
Longrong Yang ⋅ Dong Shen ⋅ Chaoxiang Cai ⋅ Kaibing Chen ⋅ Fan Yang ⋅ Tingting Gao ⋅ Di ZHANG ⋅ Xi Li
ExHall D Poster #384
GenFusion: Closing the Loop between Reconstruction and Generation via Videos Poster Session 2
Sibo Wu ⋅ Congrong Xu ⋅ Binbin Huang ⋅ Andreas Geiger ⋅ Anpei Chen
ExHall D Poster #61
VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis Poster Session 3
Zhifeng Wang ⋅ Renjiao Yi ⋅ Xin Wen ⋅ Chenyang Zhu ⋅ Kai Xu
ExHall D Poster #483
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting Poster Session 3
Cheng Zhang ⋅ Haofei Xu ⋅ Qianyi Wu ⋅ Camilo Cruz Gambardella ⋅ Dinh Phung ⋅ Jianfei Cai
ExHall D Poster #74
RaSS: Improving Denoising Diffusion Samplers with Reinforced Active Sampling Scheduler Poster Session 3
Xin Ding ⋅ Lei Yu ⋅ Xin Li ⋅ Zhijun Tu ⋅ Hanting Chen ⋅ Jie Hu ⋅ Zhibo Chen
ExHall D Poster #217
MixerMDM: Learnable Composition of Human Motion Diffusion Models Poster Session 3
Pablo Ruiz-Ponce ⋅ German Barquero ⋅ Cristina Palmero ⋅ Sergio Escalera ⋅ Jose Garcia-Rodriguez
ExHall D Poster #165
LEDiff: Latent Exposure Diffusion for HDR Generation Poster Session 1
Chao Wang ⋅ Zhihao Xia ⋅ Thomas Leimkuehler ⋅ Karol Myszkowski ⋅ Xuaner Zhang
ExHall D Poster #27
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Poster Session 5
Sili Chen ⋅ Hengkai Guo ⋅ Shengnan Zhu ⋅ Feihu Zhang ⋅ Zilong Huang ⋅ Jiashi Feng ⋅ Bingyi Kang
ExHall D Poster #169
Show and Segment: Universal Medical Image Segmentation via In-Context Learning Poster Session 4
Yunhe Gao ⋅ Di Liu ⋅ Zhuowei Li ⋅ Yunsheng Li ⋅ Dongdong Chen ⋅ Mu Zhou ⋅ Dimitris N. Metaxas
ExHall D Poster #478
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models Poster Session 2
Sangwon Jang ⋅ June Suk Choi ⋅ Jaehyeong Jo ⋅ Kimin Lee ⋅ Sung Ju Hwang
ExHall D Poster #270
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh Poster Session 5
Xiangjun Gao ⋅ Xiaoyu Li ⋅ Yiyu Zhuang ⋅ Qi Zhang ⋅ Wenbo Hu ⋅ Chaopeng Zhang ⋅ Yao Yao ⋅ Ying Shan ⋅ Long Quan
ExHall D Poster #33
Towards Consistent Multi-Task Learning: Unlocking the Potential of Task-Specific Parameters Poster Session 2
Xiaohan Qin ⋅ Xiaoxing Wang ⋅ Junchi Yan
ExHall D Poster #447
Rectified Diffusion Guidance for Conditional Generation Poster Session 3
Mengfei Xia ⋅ Nan Xue ⋅ Yujun Shen ⋅ Ran Yi ⋅ Tieliang Gong ⋅ Yong-Jin Liu
ExHall D Poster #259
Towards In-the-wild 3D Plane Reconstruction from a Single Image Poster Session 6
Jiachen Liu ⋅ Rui Yu ⋅ Sili Chen ⋅ Sharon X. Huang ⋅ Hengkai Guo
ExHall D Poster #84
FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis Poster Session 1
Wonjoon Jin ⋅ Qi Dai ⋅ Chong Luo ⋅ Seung-Hwan Baek ⋅ Sunghyun Cho
ExHall D Poster #176
RAD: Region-Aware Diffusion Models for Image Inpainting Poster Session 1
Sora Kim ⋅ Sungho Suh ⋅ Minsik Lee
ExHall D Poster #216
Supervising Sound Localization by In-the-wild Egomotion Poster Session 5
Anna Min ⋅ Ziyang Chen ⋅ Hang Zhao ⋅ Andrew Owens
ExHall D Poster #279
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning Poster Session 5
Yuheng Xu ⋅ Shijie Yang ⋅ Xin Liu ⋅ Jie Liu ⋅ Jie Tang ⋅ Gangshan Wu
ExHall D Poster #197
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Poster Session 3
Weixi Feng ⋅ Chao Liu ⋅ Sifei Liu ⋅ William Yang Wang ⋅ Arash Vahdat ⋅ Weili Nie
ExHall D Poster #223
Scaling Vision Pre-Training to 4K Resolution Poster Session 2
Baifeng Shi ⋅ Boyi Li ⋅ Han Cai ⋅ Yao Lu ⋅ Sifei Liu ⋅ Marco Pavone ⋅ Jan Kautz ⋅ Song Han ⋅ Trevor Darrell ⋅ Pavlo Molchanov ⋅ Danny Yin
ExHall D Poster #406
NVILA: Efficient Frontier Visual Language Models Poster Session 1
Zhijian Liu ⋅ Ligeng Zhu ⋅ Baifeng Shi ⋅ Zhuoyang Zhang ⋅ Yuming Lou ⋅ Shang Yang ⋅ Haocheng Xi ⋅ Shiyi Cao ⋅ Yuxian Gu ⋅ Dacheng Li ⋅ Xiuyu Li ⋅ Haotian Tang ⋅ Yunhao Fang ⋅ Yukang Chen ⋅ Cheng-Yu Hsieh ⋅ De-An Huang ⋅ An-Chieh Cheng ⋅ Jinyi Hu ⋅ Sifei Liu ⋅ Ranjay Krishna ⋅ Pavlo Molchanov ⋅ Jan Kautz ⋅ Danny Yin ⋅ Song Han ⋅ Yao Lu
ExHall D Poster #377
LidarGait++: Learning Local Features and Size Awareness from LiDAR Point Clouds for 3D Gait Recognition Poster Session 2
Chuanfu Shen ⋅ Rui Wang ⋅ Lixin Duan ⋅ Shiqi Yu
ExHall D Poster #120
UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation Poster Session 6
Yichong Lu ⋅ Yichi Cai ⋅ Shangzhan Zhang ⋅ Hongyu Zhou ⋅ Haoji Hu ⋅ Huimin Yu ⋅ Andreas Geiger ⋅ Yiyi Liao
ExHall D Poster #130
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models Poster Session 6
Jianlong Jin ⋅ Chenglong Zhao ⋅ Ruixin Zhang ⋅ Sheng Shang ⋅ Jianqing Xu ⋅ Jingyun Zhang ⋅ ShaoMing Wang ⋅ Yang Zhao ⋅ Shouhong Ding ⋅ Wei Jia ⋅ Yunsheng Wu
ExHall D Poster #17
Arbitrary-steps Image Super-resolution via Diffusion Inversion Poster Session 5
Zongsheng Yue ⋅ Kang Liao ⋅ Chen Change Loy
ExHall D Poster #199
UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly Detection Poster Session 2
Shun Wei ⋅ Jielin Jiang ⋅ Xiaolong Xu
ExHall D Poster #440
3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement Poster Session 4
Yihang Luo ⋅ Shangchen Zhou ⋅ Yushi Lan ⋅ Xingang Pan ⋅ Chen Change Loy
ExHall D Poster #56
Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding Poster Session 5
Jiaxin Shi ⋅ Mingyue Xiang ⋅ Hao Sun ⋅ Yixuan Huang ⋅ Zhi Weng
ExHall D Poster #338
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach Poster Session 4
Baoshun Tong ⋅ Hanjiang Lai ⋅ Yan Pan ⋅ Jian Yin
ExHall D Poster #392
Towards General Visual-Linguistic Face Forgery Detection Poster Session 4
Ke Sun ⋅ Shen Chen ⋅ Taiping Yao ⋅ Ziyin Zhou ⋅ Jiayi Ji ⋅ Xiaoshuai Sun ⋅ Chia-Wen Lin ⋅ Rongrong Ji
ExHall D Poster #359
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Poster Session 1
Jianyi Wang ⋅ Zhijie Lin ⋅ Meng Wei ⋅ Yang Zhao ⋅ Ceyuan Yang ⋅ Chen Change Loy ⋅ Lu Jiang
ExHall D Poster #187
Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts Poster Session 3
Feng Liang ⋅ Haoyu Ma ⋅ Zecheng He ⋅ Tingbo Hou ⋅ Ji Hou ⋅ Kunpeng Li ⋅ Xiaoliang Dai ⋅ Felix Juefei-Xu ⋅ Samaneh Azadi ⋅ Animesh Sinha ⋅ Peizhao Zhang ⋅ Peter Vajda ⋅ Diana Marculescu
ExHall D Poster #238
MoEdit: On Learning Quantity Perception for Multi-object Image Editing Poster Session 1
Yanfeng Li ⋅ Ka-Hou Chan ⋅ Yue Sun ⋅ Chan-Tong Lam ⋅ Tong Tong ⋅ Zitong YU ⋅ Keren Fu ⋅ Xiaohong Liu ⋅ Tao Tan
ExHall D Poster #241
Seeing More with Less: Human-like Representations in Vision Models Poster Session 1
Andrey Gizdov ⋅ Shimon Ullman ⋅ Daniel Harari
ExHall D Poster #407
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding Poster Session 3
Thanh-Dat Truong ⋅ Utsav Prabhu ⋅ Bhiksha Raj ⋅ Jackson Cothren ⋅ Khoa Luu
ExHall D Poster #423
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification Poster Session 2
Jiayu Jiang ⋅ Changxing Ding ⋅ Wentao Tan ⋅ Junhong Wang ⋅ JIN Tao ⋅ Xiangmin Xu
ExHall D Poster #367
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Poster Session 3
Jihan Yang ⋅ Shusheng Yang ⋅ Anjali W. Gupta ⋅ Rilyn Han ⋅ Li Fei-Fei ⋅ Saining Xie
ExHall D Poster #287
Nested Diffusion Models Using Hierarchical Latent Priors Poster Session 1
Xiao Zhang ⋅ Ruoxi Jiang ⋅ Rebecca Willett ⋅ Michael Maire
ExHall D Poster #224
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning Poster Session 2
Jiange Yang ⋅ Haoyi Zhu ⋅ Yating Wang ⋅ Gangshan Wu ⋅ Tong He ⋅ Limin Wang
ExHall D Poster #152
Localizing Events in Videos with Multimodal Queries Poster Session 1
Gengyuan Zhang ⋅ Mang Ling Ada Fok ⋅ Jialu Ma ⋅ Yan Xia ⋅ Philip H.S. Torr ⋅ Daniel Cremers ⋅ Volker Tresp ⋅ Jindong Gu
ExHall D Poster #303
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting Poster Session 6
Liao Shen ⋅ Tianqi Liu ⋅ Huiqiang Sun ⋅ Jiaqi Li ⋅ Zhiguo Cao ⋅ Wei Li ⋅ Chen Change Loy
ExHall D Poster #26
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention Poster Session 3
Yuhan Wang ⋅ Fangzhou Hong ⋅ Shuai Yang ⋅ Liming Jiang ⋅ Wayne Wu ⋅ Chen Change Loy
ExHall D Poster #61
F-LMM: Grounding Frozen Large Multimodal Models Poster Session 5
Size Wu ⋅ Sheng Jin ⋅ Wenwei Zhang ⋅ Lumin Xu ⋅ Wentao Liu ⋅ Wei Li ⋅ Chen Change Loy
ExHall D Poster #352
EdgeTAM: On-Device Track Anything Model Poster Session 3
Chong Zhou ⋅ Chenchen Zhu ⋅ Yunyang Xiong ⋅ Saksham Suri ⋅ Fanyi Xiao ⋅ Lemeng Wu ⋅ Raghuraman Krishnamoorthi ⋅ Bo Dai ⋅ Chen Change Loy ⋅ Vikas Chandra ⋅ Bilge Soran
ExHall D Poster #304
CleanDIFT: Diffusion Features without Noise Poster Session 1
Nick Stracke ⋅ Stefan Andreas Baumann ⋅ Kolja Bauer ⋅ Frank Fundel ⋅ Björn Ommer
ExHall D Poster #218
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction Poster Session 5
Xiaolu Liu ⋅ Ruizi Yang ⋅ Song Wang ⋅ Wentong Li ⋅ Junbo Chen ⋅ Jianke Zhu
ExHall D Poster #125
Don't Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving Poster Session 5
Ziying Song ⋅ Caiyan Jia ⋅ Lin Liu ⋅ Hongyu Pan ⋅ Yongchang Zhang ⋅ Junming Wang ⋅ Xingyu Zhang ⋅ Shaoqing Xu ⋅ Lei Yang ⋅ Yadan Luo
ExHall D Poster #131
UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior Poster Session 4
I-Hsiang (Aaron) Chen ⋅ Wei-Ting Chen ⋅ Yu-Wei Liu ⋅ Yuan-Chun Chiang ⋅ Sy-Yen Kuo ⋅ Ming-Hsuan Yang
ExHall D Poster #206
Complexity Experts are Task-Discriminative Learners for Any Image Restoration Poster Session 3
Eduard Zamfir ⋅ Zongwei Wu ⋅ Nancy Mehta ⋅ Yuedong Tan ⋅ Danda Paudel ⋅ Yulun Zhang ⋅ Radu Timofte
ExHall D Poster #201
Generative Omnimatte: Learning to Decompose Video into Layers Poster Session 3
Yao-Chih Lee ⋅ Erika Lu ⋅ Sarah Rumbley ⋅ Michal Geyer ⋅ Jia-Bin Huang ⋅ Tali Dekel ⋅ Forrester Cole
ExHall D Poster #178
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Poster Session 4
Dongshuo Yin ⋅ Leiyi Hu ⋅ Bin Li ⋅ Youqun Zhang ⋅ Xue Yang
ExHall D Poster #407
Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance Poster Session 1
Sanchayan Santra ⋅ Vishal Chudasama ⋅ Pankaj Wasnik ⋅ Vineeth Balasubramanian
ExHall D Poster #287
KAC: Kolmogorov-Arnold Classifier for Continual Learning Poster Session 3
Yusong Hu ⋅ Zichen Liang ⋅ Fei Yang ⋅ Qibin Hou ⋅ Xialei Liu ⋅ Ming-Ming Cheng
ExHall D Poster #445
Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation Poster Session 1
Qinghe Ma ⋅ Jian Zhang ⋅ Zekun Li ⋅ Lei Qi ⋅ Qian Yu ⋅ Yinghuan Shi
ExHall D Poster #479
ATP: Adaptive Threshold Pruning for Efficient Data Encoding in Quantum Neural Networks Poster Session 4
Mohamed Afane ⋅ Gabrielle Ebbrecht ⋅ Ying Wang ⋅ Juntao Chen ⋅ Junaid Farooq
ExHall D Poster #440
dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis Poster Session 2
Luyuan Xie ⋅ Tianyu Luan ⋅ Wenyuan Cai ⋅ Guochen Yan ⋅ Zhaoyu Chen ⋅ Nan Xi ⋅ Yuejian Fang ⋅ Qingni Shen ⋅ Zhonghai Wu ⋅ Junsong Yuan
ExHall D Poster #460
AFL: A Single-Round Analytic Approach for Federated Learning with Pre-trained Models Poster Session 1
Run He ⋅ Kai Tong ⋅ Di Fang ⋅ Han Sun ⋅ Ziqian Zeng ⋅ Haoran Li ⋅ Tianyi Chen ⋅ Huiping Zhuang
ExHall D Poster #461
MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output Poster Session 5
Yanyuan Chen ⋅ Dexuan Xu ⋅ Yu Huang ⋅ Songkun Zhan ⋅ Hanpin Wang ⋅ Dongxue Chen ⋅ Xueping Wang ⋅ Meikang Qiu ⋅ Hang Li
ExHall D Poster #354
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery? Poster Session 3
Fengxiang Wang ⋅ hongzhen wang ⋅ Zonghao Guo ⋅ Di Wang ⋅ Yulin Wang ⋅ Mingshuo Chen ⋅ Qiang Ma ⋅ Long Lan ⋅ Wenjing Yang ⋅ Jing Zhang ⋅ Zhiyuan Liu ⋅ Maosong Sun
ExHall D Poster #351
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding Poster Session 1
Shuming Liu ⋅ Chen Zhao ⋅ Tianqi Xu ⋅ Bernard Ghanem
ExHall D Poster #301
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks Poster Session 4
Uranik Berisha ⋅ Jens Mehnert ⋅ Alexandru Paul Condurache
ExHall D Poster #408
Structured 3D Latents for Scalable and Versatile 3D Generation Poster Session 5
Jianfeng XIANG ⋅ Zelong Lv ⋅ Sicheng Xu ⋅ Yu Deng ⋅ Ruicheng Wang ⋅ Bowen Zhang ⋅ Dong Chen ⋅ Xin Tong ⋅ Jiaolong Yang
ExHall D Poster #40
Towards All-in-One Medical Image Re-Identification Poster Session 6
Yuan Tian ⋅ Kaiyuan Ji ⋅ Rongzhao Zhang ⋅ Yankai Jiang ⋅ Chunyi Li ⋅ Xiaosong Wang ⋅ Guangtao Zhai
ExHall D Poster #443
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories Poster Session 1
Muzhi Zhu ⋅ Yuzhuo Tian ⋅ Hao Chen ⋅ Chunluan Zhou ⋅ Qingpei Guo ⋅ Yang Liu ⋅ Ming Yang ⋅ Chunhua Shen
ExHall D Poster #335
SceneCrafter: Controllable Multi-View Driving Scene Editing Poster Session 2
Zehao Zhu ⋅ Yuliang Zou ⋅ Chiyu “Max” Jiang ⋅ Bo Sun ⋅ Vincent Casser ⋅ XIUKUN HUANG ⋅ Jiahao Wang ⋅ Zhenpei Yang ⋅ Ruiqi Gao ⋅ Leonidas Guibas ⋅ Mingxing Tan ⋅ Dragomir Anguelov
ExHall D Poster #138
AMO Sampler: Enhancing Text Rendering with Overshooting Poster Session 3
Xixi Hu ⋅ Keyang Xu ⋅ Bo Liu ⋅ Hongliang Fei ⋅ Qiang Liu
ExHall D Poster #239
Disentangling Safe and Unsafe Image Corruptions via Anisotropy and Locality Poster Session 2
Ramchandran Muthukumar ⋅ Ambar Pal ⋅ Jeremias Sulam ⋅ Rene Vidal
ExHall D Poster #436
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model Poster Session 2
Chenjie Cao ⋅ Chaohui Yu ⋅ Shang Liu ⋅ Fan Wang ⋅ Xiangyang Xue ⋅ Yanwei Fu
ExHall D Poster #58
Bayesian Test-Time Adaptation for Vision-Language Models Poster Session 6
Lihua Zhou ⋅ Mao Ye ⋅ Shuaifeng Li ⋅ Nianxin Li ⋅ Xiatian Zhu ⋅ Lei Deng ⋅ Hongbin Liu ⋅ Zhen Lei
ExHall D Poster #368
ACE: Anti-Editing Concept Erasure in Text-to-Image Models Poster Session 5
Zihao Wang ⋅ Yuxiang Wei ⋅ Fan Li ⋅ Renjing Pei ⋅ Hang Xu ⋅ Wangmeng Zuo
ExHall D Poster #234
Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective Poster Session 5
Duowang Zhu ⋅ Xiaohu Huang ⋅ Haiyan Huang ⋅ Hao Zhou ⋅ Zhenfeng Shao
ExHall D Poster #286
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality Poster Session 3
Sanghyeok Lee ⋅ Joonmyung Choi ⋅ Hyunwoo J. Kim
ExHall D Poster #408
A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models Poster Session 4
Keyu Tu ⋅ Mengqi Huang ⋅ Zhuowei Chen ⋅ Zhendong Mao
ExHall D Poster #258
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation Poster Session 2
Vladan Stojnić ⋅ Yannis Kalantidis ⋅ Jiri Matas ⋅ Giorgos Tolias
ExHall D Poster #421
Neural Inverse Rendering from Propagating Light Poster Session 3
Anagh Malik ⋅ Benjamin Attal ⋅ Andrew Xie ⋅ Matthew O’Toole ⋅ David B. Lindell
ExHall D Poster #30
LUCAS: Layered Universal Codec Avatars Poster Session 5
Di Liu ⋅ Teng Deng ⋅ Giljoo Nam ⋅ Yu Rong ⋅ Stanislav Pidhorskyi ⋅ Junxuan Li ⋅ Jason Saragih ⋅ Dimitris N. Metaxas ⋅ Chen Cao
ExHall D Poster #8
A Universal Scale-Adaptive Deformable Transformer for Image Restoration across Diverse Artifacts Poster Session 3
Xuyi He ⋅ Yuhui Quan ⋅ Ruotao Xu ⋅ Hui Ji
ExHall D Poster #199
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video Poster Session 3
Zhaolin Wan ⋅ Han Qin ⋅ Zhiyang Li ⋅ Xiaopeng Fan ⋅ Wangmeng Zuo ⋅ Debin Zhao
ExHall D Poster #187
WISE: A Framework for Gigapixel Whole-Slide-Image Lossless Compression Poster Session 6
Yu Mao ⋅ Jun Wang ⋅ Nan Guan ⋅ Chun Jason Xue
ExHall D Poster #305
LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate Poster Session 4
Haoyan Gong ⋅ Zhenrong Zhang ⋅ Yuzheng Feng ⋅ Anh Nguyen ⋅ Hongbin Liu
ExHall D Poster #193
Gromov–Wasserstein Problem with Cyclic Symmetry Poster Session 5
Shoichiro Takeda ⋅ Yasunori Akagi
ExHall D Poster #69
HuPerFlow: A Comprehensive Benchmark for Human vs. Machine Motion Estimation Comparison Poster Session 5
Yung-Hao Yang ⋅ Zitang Sun ⋅ Taiki Fukiage ⋅ Shin'ya Nishida
ExHall D Poster #166
IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images Poster Session 1
Chih-Hao Lin ⋅ Jia-Bin Huang ⋅ Zhengqin Li ⋅ Zhao Dong ⋅ Christian Richardt ⋅ Michael Zollhoefer ⋅ Tuotuo Li ⋅ Johannes Kopf ⋅ Shenlong Wang ⋅ Changil Kim
ExHall D Poster #28
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images Poster Session 2
Junjin Xiao ⋅ Qing Zhang ⋅ Yongwei Nie ⋅ Lei Zhu ⋅ Wei-Shi Zheng
ExHall D Poster #51
SDBF: Steep-Decision-Boundary Fingerprinting for Hard-Label Tampering Detection of DNN Models Poster Session 6
Xiaofan Bai ⋅ Shixin Li ⋅ Xiaojing Ma ⋅ Bin Benjamin Zhu ⋅ Dongmei Zhang ⋅ Linchen Yu
ExHall D Poster #299
EnliveningGS: Active Locomotion of 3DGS Poster Session 1
Siyuan Shen ⋅ Tianjia Shao ⋅ Kun Zhou ⋅ Chenfanfu Jiang ⋅ Yin Yang
ExHall D Poster #68
Secret Lies in Color: Enhancing AI-Generated Images Detection with Color Distribution Analysis Poster Session 3
Zexi Jia ⋅ Chuanwei Huang ⋅ Yeshuang Zhu ⋅ Hongyan Fei ⋅ Xiaoyue Duan ⋅ Yuan Zhiqiang ⋅ Ying Deng ⋅ Jiapei Zhang ⋅ Jinchao Zhang ⋅ Jie Zhou
ExHall D Poster #267
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking Poster Session 4
Wenrui Cai ⋅ Qingjie Liu ⋅ Yunhong Wang
ExHall D Poster #100
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Poster Session 4
Felix Wimbauer ⋅ Weirong Chen ⋅ Dominik Muhle ⋅ Christian Rupprecht ⋅ Daniel Cremers
ExHall D Poster #85
ExpertAF: Expert Actionable Feedback from Video Poster Session 3
Kumar Ashutosh ⋅ Tushar Nagarajan ⋅ Georgios Pavlakos ⋅ Kris Kitani ⋅ Kristen Grauman
ExHall D Poster #280
Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding Poster Session 3
Zhaoran Zhao ⋅ Peng Lu ⋅ Anran Zhang ⋅ Pei Pei Li ⋅ Xia Li ⋅ Xuannan Liu ⋅ Yang Hu ⋅ Shiyi Chen ⋅ liweiwang ⋅ Wenhao Guo
ExHall D Poster #360
Exploring Timeline Control for Facial Motion Generation Poster Session 1
Yifeng Ma ⋅ Jinwei Qi ⋅ Chaonan Ji ⋅ Peng Zhang ⋅ Bang Zhang ⋅ Zhidong Deng ⋅ Liefeng Bo
ExHall D Poster #164
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation Poster Session 6
Xiaoqi Li ⋅ Lingyun Xu ⋅ Mingxu Zhang ⋅ Jiaming Liu ⋅ Yan Shen ⋅ Iaroslav Ponomarenko ⋅ Jiahui Xu ⋅ Liang Heng ⋅ Siyuan Huang ⋅ Shanghang Zhang ⋅ Hao Dong
ExHall D Poster #141
PhysGen3D: Crafting a Miniature Interactive World from a Single Image Poster Session 2
Boyuan Chen ⋅ Hanxiao Jiang ⋅ Shaowei Liu ⋅ Saurabh Gupta ⋅ Yunzhu Li ⋅ Hao Zhao ⋅ Shenlong Wang
ExHall D Poster #71
MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation Poster Session 4
zhuangzhuang chen ⋅ hualiang wang ⋅ Chubin Ou ⋅ Xiaomeng Li
ExHall D Poster #483
Learning Visual Generative Priors without Text Poster Session 2
Shuailei Ma ⋅ Kecheng Zheng ⋅ Ying Wei ⋅ Wei Wu ⋅ Fan Lu ⋅ Yifei Zhang ⋅ Chen-Wei Xie ⋅ Biao Gong ⋅ Jiapeng Zhu ⋅ Yujun Shen
ExHall D Poster #256
Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference Poster Session 4
Wenhao Shen ⋅ Mingliang Zhou ⋅ Yu Chen ⋅ Xuekai WEI ⋅ Yong Feng ⋅ Huayan Pu ⋅ Weijia Jia
ExHall D Poster #208
OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad Poster Session 5
Luyao Tang ⋅ Chaoqi Chen ⋅ Yuxuan Yuan ⋅ Zeyu Zhang ⋅ Yue Huang ⋅ Kun Zhang
ExHall D Poster #418
Let's Chorus: Partner-aware Hybrid Song-Driven 3D Head Animation Poster Session 2
Xiumei Xie ⋅ Zikai Huang ⋅ Wenhao Xu ⋅ Peng Xiao ⋅ Xuemiao Xu ⋅ Huaidong Zhang
ExHall D Poster #2
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Poster Session 6
Yutao Tang ⋅ Yuxiang Guo ⋅ Deming Li ⋅ Cheng Peng
ExHall D Poster #64
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Poster Session 6
Hanzhi Chen ⋅ Boyang Sun ⋅ Anran Zhang ⋅ Marc Pollefeys ⋅ Stefan Leutenegger
ExHall D Poster #143
FilmComposer: LLM-Driven Music Production for Silent Film Clips Poster Session 3
Zhifeng Xie ⋅ Qile He ⋅ Youjia Zhu ⋅ Qiwei He ⋅ Mengtian Li
ExHall D Poster #274
Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model Poster Session 2
Yuxiang Mao ⋅ Zhenfeng Fan ⋅ Zhijie Zhang ⋅ Zhiheng Zhang ⋅ Shihong Xia
ExHall D Poster #15
TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation Poster Session 2
Yabiao Wang ⋅ Shuo Wang ⋅ Jiangning Zhang ⋅ Ke Fan ⋅ Jiafu Wu ⋅ Xuezhucun Xue ⋅ Yong Liu
ExHall D Poster #173
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects Poster Session 5
Yue Fan ⋅ Ningjing Fan ⋅ Ivan Skorokhodov ⋅ Oleg Voynov ⋅ Savva Ignatyev ⋅ Evgeny Burnaev ⋅ Peter Wonka ⋅ Yiqun Wang
ExHall D Poster #26
Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Instructional Videos Poster Session 6
Sagnik Majumder ⋅ Tushar Nagarajan ⋅ Ziad Al-Halah ⋅ Reina Pradhan ⋅ Kristen Grauman
ExHall D Poster #275
Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation Poster Session 2
Jiawei Fu ⋅ ZHANG Tiantian ⋅ Kai Chen ⋅ Qi Dou
ExHall D Poster #343
TransPixeler: Advancing Text-to-Video Generation with Transparency Poster Session 4
Luozhou Wang ⋅ Yijun Li ⋅ ZhiFei Chen ⋅ Jui-Hsien Wang ⋅ Zhifei Zhang ⋅ He Zhang ⋅ Zhe Lin ⋅ Ying-Cong Chen
ExHall D Poster #232
Adaptive Keyframe Sampling for Long Video Understanding Poster Session 6
Xi Tang ⋅ Jihao Qiu ⋅ Lingxi Xie ⋅ Yunjie Tian ⋅ Jianbin Jiao ⋅ Qixiang Ye
ExHall D Poster #284
Person De-reidentification: A Variation-guided Identity Shift Modeling Poster Session 6
Yi-Xing Peng ⋅ Yu-Ming Tang ⋅ Kun-Yu Lin ⋅ Qize Yang ⋅ Jingke Meng ⋅ Xihan Wei ⋅ Wei-Shi Zheng
ExHall D Poster #304
FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes Poster Session 3
Lue Fan ⋅ Hao ZHANG ⋅ Qitai Wang ⋅ Hongsheng Li ⋅ Zhaoxiang Zhang
ExHall D Poster #131
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning Poster Session 2
Hasin Us Sami ⋅ Swapneel Sen ⋅ Amit K. Roy-Chowdhury ⋅ Srikanth Krishnamurthy ⋅ Basak Guler
ExHall D Poster #462
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer Poster Session 5
Ho-Joong Kim ⋅ Yearang Lee ⋅ Jung-Ho Hong ⋅ Seong-Whan Lee
ExHall D Poster #312
Making Old Film Great Again: Degradation-aware State Space Model for Old Film Restoration Poster Session 6
Yudong Mao ⋅ Hao Luo ⋅ Zhiwei Zhong ⋅ Peilin CHEN ⋅ Zhijiang Zhang ⋅ Shiqi Wang
ExHall D Poster #178
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Poster Session 2
Runtao Liu ⋅ Haoyu Wu ⋅ Zheng Ziqiang ⋅ Chen Wei ⋅ Yingqing He ⋅ Renjie Pi ⋅ Qifeng Chen
ExHall D Poster #252
Iterative Predictor-Critic Code Decoding for Real-World Image Dehazing Poster Session 3
Jiayi Fu ⋅ Siyu Liu ⋅ Zikun Liu ⋅ Chun-Le Guo ⋅ Hyunhee Park ⋅ Rui-Qi Wu ⋅ Guoqing Wang ⋅ Chongyi Li
ExHall D Poster #196
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering Poster Session 6
Guofeng Feng ⋅ Siyan Chen ⋅ Rong Fu ⋅ Zimu Liao ⋅ Yi Wang ⋅ Tao Liu ⋅ Boni Hu ⋅ Linning Xu ⋅ PeiZhilin ⋅ Hengjie Li ⋅ Xiuhong Li ⋅ Ninghui Sun ⋅ Xingcheng Zhang ⋅ Bo Dai
ExHall D Poster #47
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception Poster Session 2
Wenzhuo Liu ⋅ Wenshuo Wang ⋅ Yicheng Qiao ⋅ Qiannan Guo ⋅ Jiayin Zhu ⋅ Pengfei Li ⋅ Zilong Chen ⋅ Huiming Yang ⋅ Zhiwei Li ⋅ Lening Wang ⋅ Tiao Tan ⋅ Huaping Liu
ExHall D Poster #143
Realistic Test-Time Adaptation of Vision-Language Models Poster Session 5
Maxime Zanella ⋅ Clément Fuchs ⋅ Christophe De Vleeschouwer ⋅ Ismail Ben Ayed
ExHall D Poster #388
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting Poster Session 5
Gyeongjin Kang ⋅ Jisang Yoo ⋅ Jihyeon Park ⋅ Seungtae Nam ⋅ Hyeonsoo Im ⋅ Shin sangheon ⋅ Sangpil Kim ⋅ Eunbyung Park
ExHall D Poster #92
Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling Poster Session 5
Nannan Li ⋅ Kevin Shih ⋅ Bryan A. Plummer
ExHall D Poster #18
RDD: Robust Feature Detector and Descriptor using Deformable Transformer Poster Session 2
Gonglin Chen ⋅ Tianwen Fu ⋅ Haiwei Chen ⋅ Wenbin Teng ⋅ Hanyuan Xiao ⋅ Yajie Zhao
ExHall D Poster #97
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs Poster Session 4
Yi Fang ⋅ Bowen Jin ⋅ Jiacheng Shen ⋅ Sirui Ding ⋅ Qiaoyu Tan ⋅ Jiawei Han
ExHall D Poster #349
Erasing Undesirable Influence in Diffusion Models Poster Session 6
Jing Wu ⋅ Trung Le ⋅ Munawar Hayat ⋅ Mehrtash Harandi
ExHall D Poster #200
LT3SD: Latent Trees for 3D Scene Diffusion Poster Session 1
Quan Meng ⋅ Lei Li ⋅ Matthias Nießner ⋅ Angela Dai
ExHall D Poster #45
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation Poster Session 3
Hao Zhu ⋅ Yan Zhu ⋅ Jiayu Xiao ⋅ Tianxiang Xiao ⋅ Yike Ma ⋅ Yucheng Zhang ⋅ Feng Dai
ExHall D Poster #324
Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection Poster Session 3
Jikang Cheng ⋅ Zhiyuan Yan ⋅ Ying Zhang ⋅ Li Hao ⋅ Jiaxin Ai ⋅ Qin Zou ⋅ Chen Li ⋅ Zhongyuan Wang
ExHall D Poster #313
Closest Neighbors are Harmful for Lightweight Masked Auto-encoders Poster Session 5
Jian Meng ⋅ Ahmed Hasssan ⋅ Li Yang ⋅ Deliang Fan ⋅ Jinwoo Shin ⋅ Jae-sun Seo
ExHall D Poster #400
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian Poster Session 2
Tao Xie ⋅ Xi Chen ⋅ Zhen Xu ⋅ Yiman Xie ⋅ Yudong Jin ⋅ Yujun Shen ⋅ Sida Peng ⋅ Hujun Bao ⋅ Xiaowei Zhou
ExHall D Poster #28
OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP Poster Session 2
Mohamad Hassan N C ⋅ Divyam Gupta ⋅ Mainak Singha ⋅ SAI BHARGAV RONGALI ⋅ Ankit Jha ⋅ Muhammad Haris Khan ⋅ Biplab Banerjee
ExHall D Poster #451
Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy Poster Session 2
Zaijing Li ⋅ Yuquan Xie ⋅ Rui Shao ⋅ Gongwei Chen ⋅ Dongmei Jiang ⋅ Liqiang Nie
ExHall D Poster #351
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control Poster Session 5
Mariam Hassan ⋅ Sebastian Stapf ⋅ Ahmad Rahimi ⋅ Pedro M B Rezende ⋅ Yasaman Haghighi ⋅ David Brüggemann ⋅ Isinsu Katircioglu ⋅ Lin Zhang ⋅ Xiaoran Chen ⋅ Suman Saha ⋅ Marco Cannici ⋅ Elie Aljalbout ⋅ Botao Ye ⋅ Xi Wang ⋅ Aram Davtyan ⋅ Mathieu Salzmann ⋅ Davide Scaramuzza ⋅ Marc Pollefeys ⋅ Paolo Favaro ⋅ Alex Alahi
ExHall D Poster #129
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis Poster Session 1
Arpita Chowdhury ⋅ Dipanjyoti Paul ⋅ Zheda Mai ⋅ Jianyang Gu ⋅ Ziheng Zhang ⋅ Kazi Sajeed Mehrab ⋅ Elizabeth Campolongo ⋅ Daniel Rubenstein ⋅ Charles Stewart ⋅ Anuj Karpatne ⋅ Tanya Berger-Wolf ⋅ Yu Su ⋅ Wei-Lun Chao
ExHall D Poster #404
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching Poster Session 4
Bin Wang ⋅ Fan Wu ⋅ Linke Ouyang ⋅ Zhuangcheng Gu ⋅ Rui Zhang ⋅ Renqiu Xia ⋅ Botian Shi ⋅ Bo Zhang ⋅ Conghui He
ExHall D Poster #369
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Poster Session 5
Linke Ouyang ⋅ Yuan Qu ⋅ Hongbin Zhou ⋅ Jiawei Zhu ⋅ Rui Zhang ⋅ Qunshu Lin ⋅ Bin Wang ⋅ Zhiyuan Zhao ⋅ Man Jiang ⋅ Xiaomeng Zhao ⋅ Jin Shi ⋅ Fan Wu ⋅ Pei Chu ⋅ Minghao Liu ⋅ Zhenxiang Li ⋅ Chao Xu ⋅ Bo Zhang ⋅ Botian Shi ⋅ Zhongying Tu ⋅ Conghui He
ExHall D Poster #364
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion Poster Session 6
Haotian Wang ⋅ Yuzhe Weng ⋅ Yueyan Li ⋅ Zilu Guo ⋅ Jun Du ⋅ Shutong Niu ⋅ Jiefeng Ma ⋅ Shan He ⋅ Wu Xiaoyan ⋅ Qiming Hu ⋅ Bing Yin ⋅ Cong Liu ⋅ Qingfeng Liu
ExHall D Poster #1
Docopilot: Improving Multimodal Models for Document-Level Understanding Poster Session 1
Yuchen Duan ⋅ Zhe Chen ⋅ Yusong Hu ⋅ Weiyun Wang ⋅ Shenglong Ye ⋅ Botian Shi ⋅ Lewei Lu ⋅ Qibin Hou ⋅ Tong Lu ⋅ Hongsheng Li ⋅ Jifeng Dai ⋅ Wenhai Wang
ExHall D Poster #367
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation Poster Session 4
Libiao Chen ⋅ Dong Nie ⋅ Junjun Pan ⋅ Jing Yan ⋅ Zhenyu Tang
ExHall D Poster #427
M3amba: Memory Mamba is All You Need for Whole Slide Image Classification Poster Session 3
Tingting Zheng ⋅ Kui Jiang ⋅ Yi Xiao ⋅ Sicheng Zhao ⋅ Hongxun Yao
ExHall D Poster #474
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving Poster Session 1
Zebin Xing ⋅ Xingyu Zhang ⋅ Yang Hu ⋅ Bo Jiang ⋅ Tong He ⋅ Qian Zhang ⋅ Xiaoxiao Long ⋅ Wei Yin
ExHall D Poster #134
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Poster Session 1
Kevin Qinghong Lin ⋅ Mike Zheng Shou
ExHall D Poster #292
Learnable Infinite Taylor Gaussian for Dynamic View Rendering Poster Session 6
Bingbing Hu ⋅ Yanyan Li ⋅ rui xie ⋅ Bo Xu ⋅ Haoye Dong ⋅ Junfeng Yao ⋅ Gim Hee Lee
ExHall D Poster #67
HyperPose: Hypernetwork-Infused Camera Pose Localization and an Extended Cambridge Landmarks Dataset Poster Session 3
Ron Ferens ⋅ Yosi Keller
ExHall D Poster #86
EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstruction Poster Session 4
Dongrui Dai ⋅ Yuxiang Xing
ExHall D Poster #63
Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields Poster Session 3
Navami Kairanda ⋅ Marc Habermann ⋅ Shanthika Shankar Naik ⋅ Christian Theobalt ⋅ Vladislav Golyanik
ExHall D Poster #68
Magma: A Foundation Model for Multimodal AI Agents Poster Session 3
Jianwei Yang ⋅ Reuben Tan ⋅ Qianhui Wu ⋅ Ruijie Zheng ⋅ Baolin Peng ⋅ Yongyuan Liang ⋅ Yu Gu ⋅ Mu Cai ⋅ Seonghyeon Ye ⋅ Joel Jang ⋅ Yuquan Deng ⋅ Jianfeng Gao
ExHall D Poster #340
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing Poster Session 4
Niu Lian ⋅ Jun Li ⋅ Jinpeng Wang ⋅ Ruisheng Luo ⋅ Yaowei Wang ⋅ Shu-Tao Xia ⋅ Bin Chen
ExHall D Poster #295
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering Poster Session 2
Federico Cocchi ⋅ Nicholas Moratelli ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Rita Cucchiara
ExHall D Poster #365
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution Poster Session 2
Xingyuan Li ⋅ Zirui Wang ⋅ Yang Zou ⋅ Zhixin Chen ⋅ Jun Ma ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Jinyuan Liu
ExHall D Poster #207
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation Poster Session 1
Pengfei Zhou ⋅ Xiaopeng Peng ⋅ Jiajun Song ⋅ Chuanhao Li ⋅ Zhaopan Xu ⋅ Yue Yang ⋅ Ziyao Guo ⋅ Hao Zhang ⋅ Yuqi Lin ⋅ Yefei He ⋅ Lirui Zhao ⋅ Shuo Liu ⋅ Tianhua Li ⋅ Yuxuan Xie ⋅ Xiaojun Chang ⋅ Yu Qiao ⋅ Wenqi Shao ⋅ Kaipeng Zhang
ExHall D Poster #245
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Poster Session 2
Kai Chen ⋅ Yunhao Gou ⋅ Runhui Huang ⋅ Zhili Liu ⋅ Daxin Tan ⋅ Jing Xu ⋅ Chunwei Wang ⋅ Yi Zhu ⋅ yihan zeng ⋅ Kuo Yang ⋅ Dingdong WANG ⋅ Kun Xiang ⋅ Haoyuan Li ⋅ Haoli Bai ⋅ Jianhua Han ⋅ Xiao-Hui Li ⋅ Weike Jin ⋅ Nian Xie ⋅ Yu Zhang ⋅ James Kwok ⋅ Hengshuang Zhao ⋅ Xiaodan Liang ⋅ Dit-Yan Yeung ⋅ Xiao Chen ⋅ Zhenguo Li ⋅ Wei Zhang ⋅ Qun Liu ⋅ Lanqing Hong ⋅ Lu Hou ⋅ Hang Xu
ExHall D Poster #1
Dual Exposure Stereo for Extended Dynamic Range 3D Imaging Poster Session 2
Juhyung Choi ⋅ Jinneyong Kim ⋅ Seokjun Choi ⋅ Jinwoo Lee ⋅ Samuel Brucker ⋅ Mario Bijelic ⋅ Felix Heide ⋅ Seung-Hwan Baek
ExHall D Poster #83
A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations Poster Session 1
Theo Bodrito ⋅ Olivier Flasseur ⋅ Julien Mairal ⋅ Jean Ponce ⋅ Maud Langlois ⋅ Anne-Marie Lagrange
ExHall D Poster #99
CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians Poster Session 4
Chongjian GE ⋅ Chenfeng Xu ⋅ Yuanfeng Ji ⋅ Chensheng Peng ⋅ Masayoshi Tomizuka ⋅ Ping Luo ⋅ Mingyu Ding ⋅ Varun Jampani ⋅ Wei Zhan
ExHall D Poster #261
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Poster Session 6
Jianping Jiang ⋅ Weiye Xiao ⋅ Zhengyu Lin ⋅ Huaizhong Zhang ⋅ Tianxiang Ren ⋅ Yang Gao ⋅ Zhiqian Lin ⋅ Zhongang Cai ⋅ Lei Yang ⋅ Ziwei Liu
ExHall D Poster #71
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research Poster Session 4
James Burgess ⋅ Jeffrey J Nirschl ⋅ Laura Bravo-Sánchez ⋅ Alejandro Lozano ⋅ Sanket Rajan Gupte ⋅ Jesus G. Galaz-Montoya ⋅ Yuhui Zhang ⋅ Yuchang Su ⋅ Disha Bhowmik ⋅ Zachary Coman ⋅ Sarina M. Hasan ⋅ Alexandra Johannesson ⋅ William D. Leineweber ⋅ Malvika G Nair ⋅ Ridhi Yarlagadda ⋅ Connor Zuraski ⋅ Wah Chiu ⋅ Sarah Cohen ⋅ Jan N. Hansen ⋅ Manuel D Leonetti ⋅ Chad Liu ⋅ Emma Lundberg ⋅ Serena Yeung
ExHall D Poster #357
Learning Temporally Consistent Video Depth from Video Diffusion Priors Poster Session 5
Jiahao Shao ⋅ Yuanbo Yang ⋅ Hongyu Zhou ⋅ Youmin Zhang ⋅ Yujun Shen ⋅ Vitor Guizilini ⋅ Yue Wang ⋅ Matteo Poggi ⋅ Yiyi Liao
ExHall D Poster #170
Generative Image Layer Decomposition with Visual Effects Poster Session 2
Jinrui Yang ⋅ Qing Liu ⋅ Yijun Li ⋅ Soo Ye Kim ⋅ Daniil Pakhomov ⋅ Mengwei Ren ⋅ Jianming Zhang ⋅ Zhe Lin ⋅ Cihang Xie ⋅ Yuyin Zhou
ExHall D Poster #217
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion Poster Session 2
Mingzhen Sun ⋅ Weining Wang ⋅ Li ⋅ Jiawei Liu ⋅ Jiahui Sun ⋅ Wanquan Feng ⋅ Shanshan Lao ⋅ SiYu Zhou ⋅ Qian HE ⋅ Jing Liu
ExHall D Poster #191
ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping Poster Session 3
Youxin Pang ⋅ Ruizhi Shao ⋅ Jiajun Zhang ⋅ Hanzhang Tu ⋅ Yun Liu ⋅ Boyao Zhou ⋅ Hongwen Zhang ⋅ Yebin Liu
ExHall D Poster #150
FlexUOD: The Answer to Real-world Unsupervised Image Outlier Detection Poster Session 3
Zhonghang Liu ⋅ Kun Zhou ⋅ Changshuo Wang ⋅ Daniel Lin ⋅ Jiangbo Lu
ExHall D Poster #434
Robust Multi-Object 4D Generation for In-the-wild Videos Poster Session 5
Wen-Hsuan Chu ⋅ Lei Ke ⋅ Jianmeng Liu ⋅ Mingxiao Huo ⋅ Pavel Tokmakov ⋅ Katerina Fragkiadaki
ExHall D Poster #97
MUSt3R: Multi-view Network for Stereo 3D Reconstruction Poster Session 1
Yohann Cabon ⋅ Lucas Stoffl ⋅ Leonid Antsfeld ⋅ Gabriela Csurka ⋅ Boris Chidlovskii ⋅ Jerome Revaud ⋅ Vincent Leroy
ExHall D Poster #82
SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection Poster Session 4
Hyo-Jun Lee ⋅ Yeong Jun Koh ⋅ Hanul Kim ⋅ Hyunseop Kim ⋅ Yonguk Lee ⋅ Jinu Lee
ExHall D Poster #127
Taxonomy-Aware Evaluation of Vision-Language Models Poster Session 2
Vésteinn Snæbjarnarson ⋅ Kevin Du ⋅ Niklas Stoehr ⋅ Serge Belongie ⋅ Ryan Cotterell ⋅ Nico Lang ⋅ Stella Frank
ExHall D Poster #357
Yo’Chameleon: Personalized Vision and Language Generation Poster Session 3
Thao Nguyen ⋅ Krishna Kumar Singh ⋅ Jing Shi ⋅ Trung Bui ⋅ Yong Jae Lee ⋅ Yuheng Li
ExHall D Poster #362
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning Poster Session 2
Fida Mohammad Thoker ⋅ Letian Jiang ⋅ Chen Zhao ⋅ Bernard Ghanem
ExHall D Poster #293
Video Language Model Pretraining with Spatio-temporal Masking Poster Session 2
Yue Wu ⋅ Zhaobo Qi ⋅ Junshu Sun ⋅ Yaowei Wang ⋅ Qingming Huang ⋅ Shuhui Wang
ExHall D Poster #304
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training Poster Session 3
Sanghwan Kim ⋅ Rui Xiao ⋅ Iuliana Georgescu ⋅ Stephan Alaniz ⋅ Zeynep Akata
ExHall D Poster #387
Lifting Motion to the 3D World via 2D Diffusion Poster Session 4
Jiaman Li ⋅ Karen Liu ⋅ Jiajun Wu
ExHall D Poster #164
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models Poster Session 4
Xin Wang ⋅ Kai Chen ⋅ Jiaming Zhang ⋅ Jingjing Chen ⋅ Xingjun Ma
ExHall D Poster #391
Your ViT is Secretly an Image Segmentation Model Poster Session 5
Tommie Kerssies ⋅ Niccolò Cavagnero ⋅ Alexander Hermans ⋅ Narges Norouzi ⋅ Giuseppe Averta ⋅ Bastian Leibe ⋅ Gijs Dubbelman ⋅ Daan de Geus
ExHall D Poster #407
Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition Poster Session 6
Hongda Liu ⋅ Yunfan Liu ⋅ Min Ren ⋅ Hao Wang ⋅ Yunlong Wang ⋅ Zhenan Sun
ExHall D Poster #296
Cross-Rejective Open-Set SAR Image Registration Poster Session 5
Shasha Mao ⋅ Shiming Lu ⋅ Zhaolong Du ⋅ Licheng Jiao ⋅ Shuiping Gou ⋅ Luntian Mou ⋅ Xuequan Lu ⋅ Lin Xiong ⋅ Yimeng Zhang
ExHall D Poster #187
Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion Poster Session 2
Xiangfeng Xu ⋅ Pinyi Zhang ⋅ Wenxuan Huang ⋅ Yunhang Shen ⋅ Haosheng Chen ⋅ Jingzhong Lin ⋅ Wei Li ⋅ Gaoqi He ⋅ Jiao Xie ⋅ Shaohui Lin
ExHall D Poster #424
Binarized Neural Network for Multi-spectral Image Fusion Poster Session 1
Junming Hou ⋅ Xiaoyu Chen ⋅ Ran Ran ⋅ Xiaofeng Cong ⋅ Xinyang Liu ⋅ Jian Wei You ⋅ Liang-Jian Deng
ExHall D Poster #194
CRISP: Object Pose and Shape Estimation with Test-Time Adaptation Poster Session 3
Jingnan Shi ⋅ Rajat Talak ⋅ Harry Zhang ⋅ David Jin ⋅ Luca Carlone
ExHall D Poster #96
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior Poster Session 1
Zichen Tang ⋅ Yuan Yao ⋅ Miaomiao Cui ⋅ Liefeng Bo ⋅ Hongyu Yang
ExHall D Poster #17
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting Poster Session 4
Hanxi Liu ⋅ Yifang Men ⋅ Zhouhui Lian
ExHall D Poster #11
Any-Resolution AI-Generated Image Detection by Spectral Learning Poster Session 4
Dimitrios Karageorgiou ⋅ Symeon Papadopoulos ⋅ Ioannis Kompatsiaris ⋅ Efstratios Gavves
ExHall D Poster #279
An Image-like Diffusion Method for Human-Object Interaction Detection Poster Session 3
Xiaofei Hui ⋅ Haoxuan Qu ⋅ Hossein Rahmani ⋅ Jun Liu
ExHall D Poster #321
MammAlps: A Multi-view Video Behavior Monitoring Dataset of Wild Mammals in the Swiss Alps Poster Session 3
Valentin Gabeff ⋅ Haozhe Qi ⋅ Brendan Flaherty ⋅ Gencer Sumbul ⋅ Alexander Mathis ⋅ Devis Tuia
ExHall D Poster #306
Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling Poster Session 4
Yinuo Wang ⋅ Yanbo Fan ⋅ Xuan Wang ⋅ Yu Guo ⋅ Fei Wang
ExHall D Poster #3
PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers Poster Session 5
Wooju Lee ⋅ Juhye Park ⋅ Dasol Hong ⋅ Changki Sung ⋅ Youngwoo Seo ⋅ DongWan Kang ⋅ Hyun Myung
ExHall D Poster #89
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens Poster Session 4
Chi Su ⋅ Xiaoxuan Ma ⋅ Jiajun Su ⋅ Yizhou Wang
ExHall D Poster #93
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens Poster Session 6
Kaihang Pan ⋅ Wang Lin ⋅ Zhongqi Yue ⋅ Tenglong Ao ⋅ Liyu Jia ⋅ Wei Zhao ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Hanwang Zhang
ExHall D Poster #334
PICD: Versatile Perceptual Image Compression with Diffusion Rendering Poster Session 6
Tongda Xu ⋅ Jiahao Li ⋅ Bin Li ⋅ Yan Wang ⋅ Ya-Qin Zhang ⋅ Yan Lu
ExHall D Poster #217
Wonderland: Navigating 3D Scenes from a Single Image Poster Session 1
Hanwen Liang ⋅ Junli Cao ⋅ Vidit Goel ⋅ Guocheng Qian ⋅ Sergei Korolev ⋅ Demetri Terzopoulos ⋅ Konstantinos N. Plataniotis ⋅ Sergey Tulyakov ⋅ Jian Ren
ExHall D Poster #59
Learning from Streaming Video with Orthogonal Gradients Poster Session 3
Tengda Han ⋅ Dilara Gokay ⋅ Joseph Heyward ⋅ Chuhan Zhang ⋅ Daniel Zoran ⋅ Viorica Patraucean ⋅ Joao Carreira ⋅ Dima Damen ⋅ Andrew Zisserman
ExHall D Poster #286
SuperLightNet: Lightweight Parameter Aggregation Network for Multimodal Brain Tumor Segmentation Poster Session 1
Feng Yu ⋅ Jiacheng Cao ⋅ Li Liu ⋅ Minghua Jiang
ExHall D Poster #481
Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models Poster Session 5
Andreas Müller ⋅ Denis Lukovnikov ⋅ Jonas Thietke ⋅ Asja Fischer ⋅ Erwin Quiring
ExHall D Poster #256
VidSeg: Training-free Video Semantic Segmentation based on Diffusion Models Poster Session 5
Qian Wang ⋅ Abdelrahman Eldesokey ⋅ Mohit Mendiratta ⋅ Fangneng Zhan ⋅ Adam Kortylewski ⋅ Christian Theobalt ⋅ Peter Wonka
ExHall D Poster #183
Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields Poster Session 5
Runfeng Li ⋅ Mikhail Okunev ⋅ Zixuan Guo ⋅ Anh H Duong ⋅ Christian Richardt ⋅ Matthew O’Toole ⋅ James Tompkin
ExHall D Poster #85
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos Poster Session 5
Edward LOO ⋅ Tianyu HUANG ⋅ Peng Li ⋅ Zhiyang Dou ⋅ Cheng Lin ⋅ Zhiming Cui ⋅ Zhen Dong ⋅ Sai-Kit Yeung ⋅ Wenping Wang ⋅ Yuan Liu
ExHall D Poster #168
Compositional Caching for Training-free Open-vocabulary Attribute Detection Poster Session 3
Marco Garosi ⋅ Alessandro Conti ⋅ Gaowen Liu ⋅ Elisa Ricci ⋅ Massimiliano Mancini
ExHall D Poster #426
Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition Poster Session 6
Wuyou Xia ⋅ Guoli Jia ⋅ Sicheng Zhao ⋅ Jufeng Yang
ExHall D Poster #330
CoLLM: A Large Language Model for Composed Image Retrieval Poster Session 1
Chuong Huynh ⋅ Jinyu Yang ⋅ Ashish Tawari ⋅ Mubarak Shah ⋅ Son Dinh Tran ⋅ Raffay Hamid ⋅ Trishul Chilimbi ⋅ Abhinav Shrivastava
ExHall D Poster #364
Efficient Diffusion as Low Light Enhancer Poster Session 5
Guanzhou Lan ⋅ Qianli Ma ⋅ YUQI YANG ⋅ Zhigang Wang ⋅ Dong Wang ⋅ Xuelong Li ⋅ Bin Zhao
ExHall D Poster #22
Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis Poster Session 1
Tim Büchner ⋅ Christoph Anders ⋅ Orlando Guntinas-Lichius ⋅ Joachim Denzler
ExHall D Poster #5
VI^3NR: Variance Informed Initialization for Implicit Neural Representations Poster Session 3
Chamin Hewa Koneputugodage ⋅ Yizhak Ben-Shabat ⋅ Sameera Ramasinghe ⋅ Stephen Gould
ExHall D Poster #270
M-LLM Based Video Frame Selection for Efficient Video Understanding Poster Session 3
Kai Hu ⋅ Feng Gao ⋅ Xiaohan Nie ⋅ Peng Zhou ⋅ Son Dinh Tran ⋅ Tal Neiman ⋅ Lingyun Wang ⋅ Mubarak Shah ⋅ Raffay Hamid ⋅ Bing Yin ⋅ Trishul Chilimbi
ExHall D Poster #292
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Poster Session 3
Mankeerat Sidhu ⋅ Hetarth Chopra ⋅ Ansel Blume ⋅ Jeonghwan Kim ⋅ Revanth Gangi Reddy ⋅ Heng Ji
ExHall D Poster #429
EgoLM: Multi-Modal Language Model of Egocentric Motions Poster Session 2
Fangzhou Hong ⋅ Vladimir Guzov ⋅ Hyo Jin Kim ⋅ Yuting Ye ⋅ Richard Newcombe ⋅ Ziwei Liu ⋅ Lingni Ma
ExHall D Poster #69
Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding Poster Session 2
Atharv Mahesh Mane ⋅ Dulanga Weerakoon ⋅ Vigneshwaran Subbaraju ⋅ Sougata Sen ⋅ Sanjay Sarma ⋅ Archan Misra
ExHall D Poster #349
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation Poster Session 3
Zhuoman Liu ⋅ Weicai Ye ⋅ Yan Luximon ⋅ Pengfei Wan ⋅ Di ZHANG
ExHall D Poster #35
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation Poster Session 1
Xin Yan ⋅ Yuxuan Cai ⋅ Qiuyue Wang ⋅ Yuan Zhou ⋅ Wenhao Huang ⋅ Huan Yang
ExHall D Poster #289
HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks Poster Session 5
Maria Pilligua ⋅ Danna Xue ⋅ Javier Vazquez-Corral
ExHall D Poster #178
UnCommon Objects in 3D Poster Session 3
Xingchen Liu ⋅ Piyush Tayal ⋅ Jianyuan Wang ⋅ Jesus Zarzar ⋅ Tom Monnier ⋅ Konstantinos Tertikas ⋅ Jiali Duan ⋅ Antoine Toisoul ⋅ Jason Y. Zhang ⋅ Natalia Neverova ⋅ Andrea Vedaldi ⋅ Roman Shapovalov ⋅ David Novotny
ExHall D Poster #331
Disentangled Pose and Appearance Guidance for Multi-Pose Generation Poster Session 2
Tengfei Xiao ⋅ Yue Wu ⋅ Yuelong Li ⋅ Can Qin ⋅ Maoguo Gong ⋅ Qiguang Miao ⋅ Wenping Ma
ExHall D Poster #19
Instant Adversarial Purification with Adversarial Consistency Distillation Poster Session 5
Chun Tong Lei ⋅ Hon Ming Yam ⋅ Zhongliang Guo ⋅ Yifei Qian ⋅ Chun Pong Lau
ExHall D Poster #316
CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework Poster Session 2
Yanlong Xu ⋅ Haoxuan Qu ⋅ Jun Liu ⋅ Wenxiao Zhang ⋅ Xun Yang
ExHall D Poster #121
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Poster Session 3
Yan Wang ⋅ Baoxiong Jia ⋅ Ziyu Zhu ⋅ Siyuan Huang
ExHall D Poster #333
Decoupling Training-Free Guided Diffusion by ADMM Poster Session 5
Youyuan Zhang ⋅ Zehua Liu ⋅ Zenan Li ⋅ Zhaoyu Li ⋅ James Clark ⋅ Xujie Si
ExHall D Poster #213
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion Poster Session 5
Trong-Tung Nguyen ⋅ Quang Nguyen ⋅ Khoi Nguyen ⋅ Anh Tran ⋅ Cuong Pham
ExHall D Poster #42
Convex Combination Star Shape Prior for Data-driven Image Semantic Segmentation Poster Session 3
Xinyu Zhao ⋅ Jun Xie ⋅ Shengzhe Chen ⋅ Jun Liu
ExHall D Poster #327
Hyperbolic Safety-Aware Vision-Language Models Poster Session 1
Tobia Poppi ⋅ Tejaswi Kasarla ⋅ Pascal Mettes ⋅ Lorenzo Baraldi ⋅ Rita Cucchiara
ExHall D Poster #387
WISH: Weakly Supervised Instance Segmentation using Heterogeneous Labels Poster Session 5
Hyeokjun Kweon ⋅ Kuk-Jin Yoon
ExHall D Poster #414
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Poster Session 4
Yifan Yu ⋅ Shaohui Liu ⋅ Rémi Pautrat ⋅ Marc Pollefeys ⋅ Viktor Larsson
ExHall D Poster #84
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition Poster Session 4
Khanh Nguyen ⋅ Ghulam Mubashar Hassan ⋅ Ajmal Mian
ExHall D Poster #110
V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection Poster Session 6
Xun Huang ⋅ Jinlong Wang ⋅ Qiming Xia ⋅ Siheng Chen ⋅ Bisheng Yang ⋅ Xin Li ⋅ Cheng Wang ⋅ Chenglu Wen
ExHall D Poster #118
Foundations of the Theory of Performance-Based Ranking Poster Session 3
Sébastien Piérard ⋅ Anaïs Halin ⋅ Anthony Cioppa ⋅ Adrien Deliege ⋅ Marc Van Droogenbroeck
ExHall D Poster #348
Community Forensics: Using Thousands of Generators to Train Fake Image Detectors Poster Session 2
Jeongsoo Park ⋅ Andrew Owens
ExHall D Poster #274
APT: Adaptive Personalized Training for Diffusion Models with Limited Data Poster Session 6
JungWoo Chae ⋅ Jiyoon Kim ⋅ Jaewoong Choi ⋅ Kyungyul Kim ⋅ Sangheum Hwang
ExHall D Poster #234
Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation Poster Session 1
Xiang Li ⋅ Zixuan Huang ⋅ Anh Thai ⋅ James Rehg
ExHall D Poster #54
Frequency-Biased Synergistic Design for Image Compression and Compensation Poster Session 3
Jiaming Liu ⋅ Qi Zheng ⋅ Zihao Liu ⋅ Yilian Zhong ⋅ Peiye Liu ⋅ Tao Liu ⋅ Shusong Xu ⋅ Yanheng Lu ⋅ Sicheng Li ⋅ Dimin Niu ⋅ Yibo Fan
ExHall D Poster #207
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering Poster Session 2
Yifan Gao ⋅ Zihang Lin ⋅ Chuanbin Liu ⋅ Min Zhou ⋅ Tiezheng Ge ⋅ Bo Zheng ⋅ Hongtao Xie
ExHall D Poster #259
Hierarchical Gaussian Mixture Model Splatting for Efficient and Part Controllable 3D Generation Poster Session 3
Qitong Yang ⋅ Mingtao Feng ⋅ Zijie Wu ⋅ Weisheng Dong ⋅ Fangfang Wu ⋅ Yaonan Wang ⋅ Ajmal Mian
ExHall D Poster #43
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Poster Session 1
Runsong Zhu ⋅ Shi Qiu ⋅ ZHENGZHE LIU ⋅ Ka-Hei Hui ⋅ Qianyi Wu ⋅ Pheng-Ann Heng ⋅ Chi-Wing Fu
ExHall D Poster #332
Quad-Pixel Image Defocus Deblurring: A New Benchmark and Model Poster Session 2
Hang Chen ⋅ Yin Xie ⋅ Xiaoxiu Peng ⋅ Lihu Sun ⋅ Wenkai Su ⋅ Xiaodong Yang ⋅ Chengming Liu
ExHall D Poster #25
DocVLM: Make Your VLM an Efficient Reader Poster Session 1
Mor Shpigel Nacson ⋅ Aviad Aberdam ⋅ Roy Ganz ⋅ Elad Ben Avraham ⋅ Alona Golts ⋅ Yair Kittenplon ⋅ Shai Mazor ⋅ Ron Litman
ExHall D #486
Revisiting Source-Free Domain Adaptation: Insights into Representativeness, Generalization, and Variety Poster Session 5
Ronghang Zhu ⋅ Mengxuan Hu ⋅ Weiming Zhuang ⋅ Lingjuan Lyu ⋅ Xiang Yu ⋅ Sheng Li
ExHall D Poster #445
Improving Gaussian Splatting with Localized Points Management Poster Session 5
Haosen Yang ⋅ Chenhao Zhang ⋅ Wenqing Wang ⋅ Marco Volino ⋅ Adrian Hilton ⋅ Li Zhang ⋅ Xiatian Zhu
ExHall D Poster #61
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency Poster Session 1
Dongyue Lu ⋅ Lingdong Kong ⋅ Tianxin Huang ⋅ Gim Hee Lee
ExHall D Poster #141
SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow Poster Session 5
Qingyuan Wang ⋅ Rui Song ⋅ Jiaojiao Li ⋅ Kerui Cheng ⋅ David Ferstl ⋅ Yinlin Hu
ExHall D Poster #95
D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation Poster Session 3
Jichun Zhao ⋅ Haiyong Jiang ⋅ Haoxuan Song ⋅ Jun Xiao ⋅ Dong Gong
ExHall D Poster #118
Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution Poster Session 1
Qihao Liu ⋅ Xi Yin ⋅ Alan L. Yuille ⋅ Andrew Brown ⋅ Mannat Singh
ExHall D Poster #249
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations Poster Session 6
Hmrishav Bandyopadhyay ⋅ Yi-Zhe Song
ExHall D Poster #212
Watermarking One for All: A Robust Watermarking Scheme Against Partial Image Theft Poster Session 2
Gaozhi Liu ⋅ Silu Cao ⋅ Zhenxing Qian ⋅ Xinpeng Zhang ⋅ Sheng Li ⋅ Wanli Peng
ExHall D Poster #272
ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On Poster Session 6
Ji Woo Hong ⋅ Tri Ton ⋅ Trung X. Pham ⋅ Gwanhyeong Koo ⋅ Sunjae Yoon ⋅ Chang D. Yoo
ExHall D Poster #202
VolFormer: Explore More Comprehensive Cube Interaction for Hyperspectral Image Restoration and Beyond Poster Session 6
Dabing Yu ⋅ Zheng Gao
ExHall D Poster #183
Recovering Dynamic 3D Sketches from Videos Poster Session 3
Jaeah Lee ⋅ Changwoon Choi ⋅ Young Min Kim ⋅ Jaesik Park
ExHall D Poster #169
Improving Transferable Targeted Attacks with Feature Tuning Mixup Poster Session 5
Kaisheng Liang ⋅ Xuelong Dai ⋅ Yanjie Li ⋅ Dong Wang ⋅ Bin Xiao
ExHall D Poster #457
OmniStereo: Real-time Omnidireactional Depth Estimation with Multiview Fisheye Cameras Poster Session 1
Jiaxi Deng ⋅ Yushen Wang ⋅ Haitao Meng ⋅ Zuoxun Hou ⋅ Yi Chang ⋅ Gang Chen
ExHall D Poster #78
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery Poster Session 1
Jiadong Tang ⋅ Yu Gao ⋅ Dianyi Yang ⋅ Liqi Yan ⋅ Yufeng Yue ⋅ Yi Yang
ExHall D Poster #62
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation Poster Session 6
Tianyi Yan ⋅ Dongming Wu ⋅ Wencheng Han ⋅ Junpeng Jiang ⋅ xia zhou ⋅ Kun Zhan ⋅ Cheng-Zhong Xu ⋅ Jianbing Shen
ExHall D Poster #131
Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency Poster Session 1
Yutong Wang ⋅ Jiajie Teng ⋅ Jiajiong Cao ⋅ Yuming Li ⋅ Chenguang Ma ⋅ Hongteng Xu ⋅ Dixin Luo
ExHall D Poster #189
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment Poster Session 4
Darshana Saravanan ⋅ Varun Gupta ⋅ Darshan Singh S ⋅ Zeeshan Khan ⋅ Vineet Gandhi ⋅ Makarand Tapaswi
ExHall D Poster #298
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation Poster Session 1
Yiren Song ⋅ Pei Yang ⋅ Hai Ci ⋅ Mike Zheng Shou
ExHall D Poster #273
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition Poster Session 3
Zheda Mai ⋅ Ping Zhang ⋅ Cheng-Hao Tu ⋅ Hong-You Chen ⋅ Quang-Huy Nguyen ⋅ Li Zhang ⋅ Wei-Lun Chao
ExHall D Poster #401
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification Poster Session 1
Jingwei Zhang ⋅ Anh Tien Nguyen ⋅ Xi Han ⋅ Vincent Quoc-Huy Trinh ⋅ Hong Qin ⋅ Dimitris Samaras ⋅ Mahdi Hosseini
ExHall D Poster #325
H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection Poster Session 3
Yuhang Liu ⋅ Wenjie Zhao ⋅ Yunhui Guo
ExHall D Poster #456
MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning Poster Session 5
Wenhao Gu ⋅ Li Gu ⋅ Ching Suen ⋅ Yang Wang
ExHall D Poster #233
Subnet-Aware Dynamic Supernet Training for Neural Architecture Search Poster Session 6
Jeimin Jeon ⋅ Youngmin Oh ⋅ Junghyup Lee ⋅ Donghyeon Baek ⋅ Dohyung Kim ⋅ Chanho Eom ⋅ Bumsub Ham
ExHall D Poster #381
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders Poster Session 4
jiajun cao ⋅ Yuan Zhang ⋅ Tao Huang ⋅ Ming Lu ⋅ Qizhe Zhang ⋅ Ruichuan An ⋅ Ningning Ma ⋅ Shanghang Zhang
ExHall D Poster #385
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model Poster Session 4
Xiaoding Yuan ⋅ Shitao Tang ⋅ Kejie Li ⋅ Peng Wang
ExHall D Poster #54
Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration Poster Session 4
yuxuan Gu ⋅ Huaian Chen ⋅ Yi Jin ⋅ Haoxuan Wang ⋅ Pengyang Ling ⋅ ZHIXIANG WEI ⋅ Enhong Chen
ExHall D Poster #20
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance Poster Session 5
Yang Yue ⋅ Yulin Wang ⋅ Haojun Jiang ⋅ Pan Liu ⋅ Shiji Song ⋅ Gao Huang
ExHall D Poster #476
FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs Poster Session 3
Mothilal Asokan ⋅ Kebin wu ⋅ Fatima Albreiki
ExHall D Poster #367
Neural Hierarchical Decomposition for Single Image Plant Modeling Poster Session 1
Zhihao Liu ⋅ Zhanglin Cheng ⋅ Naoto Yokoya
ExHall D Poster #53
Temporal Action Detection Model Compression by Progressive Block Drop Poster Session 6
Xiaoyong Chen ⋅ Yong Guo ⋅ Jiaming Liang ⋅ Sitong Zhuang ⋅ Runhao Zeng ⋅ Xiping Hu
ExHall D Poster #294
Face Forgery Video Detection via Temporal Forgery Cue Unraveling Poster Session 2
Zonghui Guo ⋅ YingJie Liu ⋅ Jie Zhang ⋅ Haiyong Zheng ⋅ Shiguang Shan
ExHall D Poster #194
Temporally Consistent Object-Centric Learning by Contrasting Slots Poster Session 2
Anna Manasyan ⋅ Maximilian Seitzer ⋅ Filip Radovic ⋅ Georg Martius ⋅ Andrii Zadaianchuk
ExHall D Poster #161
Hash3D: Training-free Acceleration for 3D Generation Poster Session 5
Xingyi Yang ⋅ Songhua Liu ⋅ Xinchao Wang
ExHall D Poster #41
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance Poster Session 4
Peishan Cong ⋅ Ziyi Wang ⋅ Yuexin Ma ⋅ Xiangyu Yue
ExHall D Poster #168
Generative Photomontage Poster Session 2
Sean J. Liu ⋅ Nupur Kumari ⋅ Ariel Shamir ⋅ Jun-Yan Zhu
ExHall D Poster #245
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Poster Session 2
Haoyu Guo ⋅ He Zhu ⋅ Sida Peng ⋅ Haotong Lin ⋅ Yunzhi Yan ⋅ Tao Xie ⋅ Wenguan Wang ⋅ Xiaowei Zhou ⋅ Hujun Bao
ExHall D Poster #80
A Unified Image-Dense Annotation Generation Model for Underwater Scenes Poster Session 1
Hongkai Lin ⋅ Dingkang Liang ⋅ Zhenghao Qi ⋅ Xiang Bai
ExHall D Poster #74
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing Poster Session 5
Ruiyi Wang ⋅ Yushuo Zheng ⋅ Zicheng Zhang ⋅ Chunyi Li ⋅ Shuaicheng Liu ⋅ Guangtao Zhai ⋅ Xiaohong Liu
ExHall D Poster #193
HuMoCon: Concept Discovery for Human Motion Understanding Poster Session 2
Qihang Fang ⋅ Chengcheng Tang ⋅ Bugra Tekin ⋅ Shugao Ma ⋅ Yanchao Yang
ExHall D Poster #174
Curriculum Direct Preference Optimization for Diffusion and Consistency Models Poster Session 1
Florinel Croitoru ⋅ Vlad Hondru ⋅ Radu Tudor Ionescu ⋅ Nicu Sebe ⋅ Mubarak Shah
ExHall D Poster #255
OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection Poster Session 5
Max Gutbrod ⋅ David Rauber ⋅ Danilo Weber Nunes ⋅ Christoph Palm
ExHall D Poster #465
Detecting Open World Objects via Partial Attribute Assignment Poster Session 4
Muli Yang ⋅ Gabriel James Goenawan ⋅ Huaiyuan Qin ⋅ Kai Han ⋅ Xi Peng ⋅ Yanhua Yang ⋅ Hongyuan Zhu
ExHall D Poster #430
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models Poster Session 6
Alice Heiman ⋅ Xiaoman Zhang ⋅ Emma Chen ⋅ Sung Eun Kim ⋅ Pranav Rajpurkar
ExHall D Poster #444
Personalized Preference Fine-tuning of Diffusion Models Poster Session 2
Meihua Dang ⋅ Anikait Singh ⋅ Linqi Zhou ⋅ Stefano Ermon ⋅ Jiaming Song
ExHall D Poster #253
FSHNet: Fully Sparse Hybrid Network for 3D Object Detection Poster Session 2
Shuai Liu ⋅ Mingyue Cui ⋅ Boyang Li ⋅ Quanmin Liang ⋅ Tinghe Hong ⋅ Kai Huang ⋅ yunxiao shan ⋅ Kai Huang
ExHall D Poster #338
3D-SLNR: A Super Lightweight Neural Representation for Large-scale 3D Mapping Poster Session 6
Chenhui Shi ⋅ Fulin Tang ⋅ Ning An ⋅ Yihong Wu
ExHall D Poster #103
STINR: Deciphering Spatial Transcriptomics via Implicit Neural Representation Poster Session 5
Yisi Luo ⋅ Xile Zhao ⋅ Kai Ye ⋅ Deyu Meng
ExHall D Poster #470
Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios Poster Session 3
Hang Shao ⋅ lei luo ⋅ Jianjun Qian ⋅ Mengkai Yan ⋅ Shuo Chen ⋅ Jian Yang
ExHall D Poster #19
Unsupervised Discovery of Facial Landmarks and Head Pose Poster Session 5
Satyajit Tourani ⋅ Siddharth Tourani ⋅ Arif Mahmood ⋅ Muhammad Haris Khan
ExHall D Poster #14
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning Poster Session 6
Sherry X. Chen ⋅ Misha Sra ⋅ Pradeep Sen
ExHall D Poster #224
Stabilizing and Accelerating Autofocus with Expert Trajectory Regularized Deep Reinforcement Learning Poster Session 6
Shouhang Zhu ⋅ Chenglin Li ⋅ Yuankun Jiang ⋅ Li Wei ⋅ Nuowen Kan ⋅ Ziyang Zheng ⋅ Wenrui Dai ⋅ Junni Zou ⋅ Hongkai Xiong
ExHall D Poster #24
Simulator HC: Regression-based Online Simulation of Starting Problem-Solution Pairs for Homotopy Continuation in Geometric Vision Poster Session 6
Xinyue Zhang ⋅ Zijia Dai ⋅ Wanting Xu ⋅ Laurent Kneip
ExHall D Poster #91
Dynamic Integration of Task-Specific Adapters for Class Incremental Learning Poster Session 6
Jiashuo Li ⋅ Shaokun Wang ⋅ Bo Qian ⋅ Yuhang He ⋅ Xing Wei ⋅ Qiang Wang ⋅ Yihong Gong
ExHall D Poster #421
MoFlow: One-Step Flow Matching for Human Trajectory Forecasting via Implicit Maximum Likelihood Estimation based Distillation Poster Session 4
Yuxiang Fu ⋅ Qi Yan ⋅ Ke Li ⋅ Lele Wang ⋅ Renjie Liao
ExHall D Poster #140
Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions Poster Session 5
Chan Hur ⋅ Jeong-hun Hong ⋅ Dong-hun Lee ⋅ Dabin Kang ⋅ Semin Myeong ⋅ Sang-hyo Park ⋅ Hyeyoung Park
ExHall D Poster #292
MaRI: Material Retrieval Integration across Domains Poster Session 2
Jianhui Wang ⋅ Zhifei Yang ⋅ Yangfan He ⋅ Huixiong Zhang ⋅ Yuxuan Chen ⋅ Jingwei Huang
ExHall D Poster #35
GeoMM: On Geodesic Perspective for Multi-modal Learning Poster Session 1
Shibin Mei ⋅ Hang Wang ⋅ Bingbing Ni
ExHall D Poster #441
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning Poster Session 2
Xueqing Wu ⋅ Yuheng Ding ⋅ Bingxuan Li ⋅ Pan Lu ⋅ Da Yin ⋅ Kai-Wei Chang ⋅ Nanyun Peng
ExHall D Poster #396
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations Poster Session 6
Jeonghyeon Kim ⋅ Sangheum Hwang
ExHall D Poster #366
Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy Poster Session 2
Zesen Cheng ⋅ Hang Zhang ⋅ Kehan Li ⋅ Sicong Leng ⋅ Zhiqiang Hu ⋅ Fei Wu ⋅ Deli Zhao ⋅ Xin Li ⋅ Lidong Bing
ExHall D Poster #444
RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training Poster Session 2
Raktim Gautam Goswami ⋅ Prashanth Krishnamurthy ⋅ Yann LeCun ⋅ Farshad Khorrami
ExHall D Poster #149
Universal Domain Adaptation for Semantic Segmentation Poster Session 1
Seun-An Choe ⋅ Keon Hee Park ⋅ Jinwoo Choi ⋅ Gyeong-Moon Park
ExHall D Poster #425
Distraction is All You Need for Multimodal Large Language Model Jailbreaking Poster Session 2
Zuopeng Yang ⋅ Jiluan Fan ⋅ Anli Yan ⋅ Erdun Gao ⋅ Xin Lin ⋅ Tao Li ⋅ Kanghua Mo ⋅ Changyu Dong
ExHall D Poster #390
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models Poster Session 2
Saeed Ranjbar Alvar ⋅ Gursimran Singh ⋅ Mohammad Akbari ⋅ Yong Zhang
ExHall D Poster #383
PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation Poster Session 4
Qihan Huang ⋅ Weilong Dai ⋅ Jinlong Liu ⋅ Wanggui He ⋅ Hao Jiang ⋅ Mingli Song ⋅ Jie Song
ExHall D Poster #245
Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry Poster Session 2
Rui Wang ⋅ Shaocheng Jin ⋅ Ziheng Chen ⋅ Xiaoqing Luo ⋅ Xiaojun Wu
ExHall D Poster #279
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation Poster Session 1
Claudia Cuttano ⋅ Gabriele Trivigno ⋅ Gabriele Rosi ⋅ Carlo Masone ⋅ Giuseppe Averta
ExHall D Poster #308
GeoAvatar: Geometrically-Consistent Multi-Person Avatar Reconstruction from Sparse Multi-View Videos Poster Session 5
Soohyun Lee ⋅ SeoYeon Kim ⋅ HeeKyung Lee ⋅ Won-Sik Cheong ⋅ Joo Ho Lee
ExHall D Poster #9
Robust-MVTON: Learning Cross-Pose Feature Alignment and Fusion for Robust Multi-View Virtual Try-On Poster Session 4
Nannan Zhang ⋅ Yijiang Li ⋅ Dong Du ⋅ Zheng Chong ⋅ Zhengwentai Sun ⋅ Jianhao Zeng ⋅ Yusheng Dai ⋅ Zhenyu Xie ⋅ Hairui Zhu ⋅ Xiaoguang Han
ExHall D Poster #16
FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity Poster Session 3
Jinxi Li ⋅ Ziyang Song ⋅ Siyuan Zhou ⋅ Bo Yang
ExHall D Poster #170
Zero-Shot Monocular Scene Flow Estimation in the Wild Poster Session 5
Yiqing Liang ⋅ Abhishek Badki ⋅ Hang Su ⋅ James Tompkin ⋅ Orazio Gallo
ExHall D Poster #165
MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities Poster Session 6
Bizhu Wu ⋅ Jinheng Xie ⋅ Keming Shen ⋅ Zhe Kong ⋅ Jianfeng Ren ⋅ Ruibin Bai ⋅ Rong Qu ⋅ Linlin Shen
ExHall D Poster #160
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation Poster Session 5
Yuheng Feng ⋅ Changsong Wen ⋅ Zelin Peng ⋅ Li jiaye ⋅ Siyu Zhu
ExHall D Poster #369
Test-time Augmentation Improves Efficiency in Conformal Prediction Poster Session 4
Divya M Shanmugam ⋅ Helen Lu ⋅ Swami Sankaranarayanan ⋅ John Guttag
ExHall D Poster #458
Breaking the Low-Rank Dilemma of Linear Attention Poster Session 5
Qihang Fan ⋅ Huaibo Huang ⋅ Ran He
ExHall D Poster #404
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Poster Session 2
Enshen Zhou ⋅ Qi Su ⋅ Cheng Chi ⋅ Zhizheng Zhang ⋅ Zhongyuan Wang ⋅ Tiejun Huang ⋅ Lu Sheng ⋅ He Wang
ExHall D Poster #148
Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow Poster Session 6
Hanyu Zhou ⋅ Haonan Wang ⋅ Haoyue Liu ⋅ Yuxing Duan ⋅ Yi Chang ⋅ Luxin Yan
ExHall D Poster #165
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis Poster Session 4
Jiahe Li ⋅ Feiyu Wang ⋅ Xiaochao Qu ⋅ WU CHENGJING ⋅ Luoqi Liu ⋅ Ting Liu
ExHall D Poster #53
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding Poster Session 4
Yawen Shao ⋅ Wei Zhai ⋅ Yuhang Yang ⋅ Hongchen Luo ⋅ Yang Cao ⋅ Zheng-Jun Zha
ExHall D Poster #147
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification Poster Session 5
Yanghao Wang ⋅ Long Chen
ExHall D Poster #432
MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations Poster Session 3
Kyungho Bae ⋅ Jinhyung Kim ⋅ Sihaeng Lee ⋅ Soonyoung Lee ⋅ Gunhee Lee ⋅ Jinwoo Choi
ExHall D Poster #296
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Poster Session 6
Nina Shvetsova ⋅ Arsha Nagrani ⋅ Bernt Schiele ⋅ Hilde Kuehne ⋅ Christian Rupprecht
ExHall D Poster #278
Layered Motion Fusion: Lifting Motion Segmentation to 3D in Egocentric Videos Poster Session 4
Vadim Tschernezki ⋅ Diane Larlus ⋅ Andrea Vedaldi ⋅ Iro Laina
ExHall D Poster #175
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing Poster Session 6
Yixuan Zhu ⋅ Haolin Wang ⋅ Shilin Ma ⋅ Wenliang Zhao ⋅ Yansong Tang ⋅ Lei Chen ⋅ Jie Zhou
ExHall D Poster #216
MotiF: Making Text Count in Image Animation with Motion Focal Loss Poster Session 2
Shijie Wang ⋅ Samaneh Azadi ⋅ Rohit Girdhar ⋅ Sai Saketh Rambhatla ⋅ Chen Sun ⋅ Xi Yin
ExHall D Poster #230
Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion Poster Session 5
Yuxi Mi ⋅ Zhizhou Zhong ⋅ Yuge Huang ⋅ Qiuyang Yuan ⋅ Xuan Zhao ⋅ Jianqing Xu ⋅ Shouhong Ding ⋅ ShaoMing Wang ⋅ Rizen Guo ⋅ Shuigeng Zhou
ExHall D Poster #15
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model Poster Session 4
Leheng Zhang ⋅ Weiyi You ⋅ Kexuan Shi ⋅ Shuhang Gu
ExHall D Poster #207
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval Poster Session 3
Yushuai Sun ⋅ Zikun Zhou ⋅ Dongmei Jiang ⋅ Yaowei Wang ⋅ Jun Yu ⋅ Guangming Lu ⋅ Wenjie Pei
ExHall D Poster #441
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation Poster Session 2
Reza Abbasi ⋅ Ali Nazari ⋅ Aminreza Sefid ⋅ Mohammadali Banayeeanzade ⋅ Mohammad Rohban ⋅ Mahdieh Baghshah
ExHall D Poster #375
MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation Poster Session 5
Shu Wang ⋅ Yanbo Gao ⋅ Shuai Li ⋅ Chong Lv ⋅ Xun Cai ⋅ chuankun Li ⋅ Hui Yuan ⋅ jinglin zhang
ExHall D Poster #32
FoundationStereo: Zero-Shot Stereo Matching Poster Session 2
Bowen Wen ⋅ Matthew Trepte ⋅ Oluwaseun Joseph Aribido ⋅ Jan Kautz ⋅ Orazio Gallo ⋅ Stan Birchfield
ExHall D Poster #81
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability Poster Session 2
Yingdong Shi ⋅ Changming Li ⋅ Yifan Wang ⋅ Yongxiang Zhao ⋅ Anqi Pang ⋅ Sibei Yang ⋅ Jingyi Yu ⋅ Kan Ren
ExHall D Poster #269
Learning Physics-Based Full-Body Human Reaching and Grasping from Brief Walking References Poster Session 6
Yitang Li ⋅ Mingxian Lin ⋅ Zhuo Lin ⋅ Yipeng Deng ⋅ Yue Cao ⋅ Li Yi
ExHall D Poster #144
Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Poster Session 1
Huabin Liu ⋅ Filip Ilievski ⋅ Cees G. M. Snoek
ExHall D Poster #296
QMambaBSR: Burst Image Super-Resolution with Query State Space Model Poster Session 5
Xin Di ⋅ Long Peng ⋅ Peizhe Xia ⋅ Wenbo Li ⋅ Renjing Pei ⋅ Yang Wang ⋅ Yang Cao ⋅ Zheng-Jun Zha
ExHall D Poster #192
ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points Poster Session 2
Qirui Huang ⋅ Runze Zhang ⋅ Kangjun Liu ⋅ Minglun Gong ⋅ Hao Zhang ⋅ Hui Huang
ExHall D Poster #114
Encapsulated Composition of Text-to-Image and Text-to-Video Models for High-Quality Video Synthesis Poster Session 4
Tongtong Su ⋅ Chengyu Wang ⋅ Bingyan Liu ⋅ Jun Huang ⋅ Dongming Lu
ExHall D Poster #229
Multi-Group Proportional Representations for Text-to-Image Models Poster Session 5
Sangwon Jung ⋅ Alex Oesterling ⋅ Claudio Mayrink Verdun ⋅ Sajani Vithana ⋅ Taesup Moon ⋅ Flavio Calmon
ExHall D Poster #261
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts Poster Session 2
Jiansheng Li ⋅ Xingxuan Zhang ⋅ Hao Zou ⋅ Yige Guo ⋅ Renzhe Xu ⋅ Yilong Liu ⋅ Chuzhao Zhu ⋅ Yue He ⋅ Peng Cui
ExHall D Poster #364
Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis Poster Session 4
M. Hamza Mughal ⋅ Rishabh Dabral ⋅ Merel CJ Scholman ⋅ Vera Demberg ⋅ Christian Theobalt
ExHall D Poster #71
Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content Poster Session 6
Rohit Kundu ⋅ Hao Xiong ⋅ Vishal Mohanty ⋅ Athula Balachandran ⋅ Amit K. Roy-Chowdhury
ExHall D Poster #179
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation Poster Session 1
Liao Qu ⋅ Huichao Zhang ⋅ Yiheng Liu ⋅ Xu Wang ⋅ Yi Jiang ⋅ Yiming Gao ⋅ Hu Ye ⋅ Daniel Kang Du ⋅ Zehuan Yuan ⋅ Xinglong Wu
ExHall D Poster #228
Improving Personalized Search with Regularized Low-Rank Parameter Updates Poster Session 4
Fiona Ryan ⋅ Josef Sivic ⋅ Fabian Caba Heilbron ⋅ Judy Hoffman ⋅ James Rehg ⋅ Bryan Russell
ExHall D Poster #376
A Focused Human Body Model for Accurate Anthropometric Measurements Extraction Poster Session 5
Shuhang Chen ⋅ Xianliang Huang ⋅ Zhizhou Zhong ⋅ Jihong Guan ⋅ Shuigeng Zhou
ExHall D Poster #152
EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection Poster Session 3
Yizheng Xie ⋅ Viktoria Ehm ⋅ Paul Roetzer ⋅ Nafie El Amrani ⋅ Maolin Gao ⋅ Florian Bernard ⋅ Daniel Cremers
ExHall D Poster #98
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization Poster Session 2
Junhao Xu ⋅ Yanan Zhang ⋅ Zhi Cai ⋅ Di Huang
ExHall D Poster #140
Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping Poster Session 1
Guannan Lai ⋅ Yujie Li ⋅ Xiangkun Wang ⋅ Junbo Zhang ⋅ Tianrui Li ⋅ Xin Yang
ExHall D Poster #452
Low-Biased General Annotated Dataset Generation Poster Session 5
Dengyang Jiang ⋅ Haoyu Wang ⋅ Lei Zhang ⋅ Wei Wei ⋅ Guang Dai ⋅ Mengmeng Wang ⋅ Jingdong Wang ⋅ Yanning Zhang
ExHall D Poster #389
Deterministic Image-to-Image Translation via Denoising Brownian Bridge Models with Dual Approximators Poster Session 6
Bohan Xiao ⋅ PEIYONG WANG ⋅ Qisheng He ⋅ Ming Dong
ExHall D Poster #197
ADD: Attribution-Driven Data Augmentation Framework for Boosting Image Super-Resolution Poster Session 5
Zeyu Mi ⋅ Yu-Bin Yang
ExHall D Poster #194
BG-Triangle: Bézier Gaussian Triangle for 3D Vectorization and Rendering Poster Session 4
Minye Wu ⋅ Haizhao Dai ⋅ Kaixin Yao ⋅ Jingyi Yu ⋅ Tinne Tuytelaars
ExHall D Poster #33
Multi-subject Open-set Personalization in Video Generation Poster Session 2
Tsai-Shien Chen ⋅ Aliaksandr Siarohin ⋅ Willi Menapace ⋅ Yuwei Fang ⋅ Kwot Sin Lee ⋅ Ivan Skorokhodov ⋅ Kfir Aberman ⋅ Jun-Yan Zhu ⋅ Ming-Hsuan Yang ⋅ Sergey Tulyakov
ExHall D Poster #63
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model Poster Session 3
Ryugo Morita ⋅ Stanislav Frolov ⋅ Brian Bernhard Moser ⋅ Takahiro Shirakawa ⋅ Ko Watanabe ⋅ Andreas Dengel ⋅ Jinjia Zhou
ExHall D Poster #227
Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation Poster Session 4
Yueru Jia ⋅ Jiaming Liu ⋅ Sixiang Chen ⋅ Chenyang Gu ⋅ Zhilve Wang ⋅ Xiaoqi Li ⋅ Longzan Luo ⋅ Pengwei Wang ⋅ Renrui Zhang ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
ExHall D Poster #149
On the Generalization of Handwritten Text Recognition Models Poster Session 3
Carlos Garrido-Munoz ⋅ Jorge Calvo-Zaragoza
ExHall D Poster #443
From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Poster Session 6
Zhiwei Huang ⋅ Hailin Yu ⋅ Yichun Shentu ⋅ Jin Yuan ⋅ Guofeng Zhang
ExHall D Poster #87
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Poster Session 2
Xinhao Liu ⋅ Jintong Li ⋅ Yicheng Jiang ⋅ Niranjan Sujay ⋅ Zhicheng Yang ⋅ Juexiao Zhang ⋅ John Abanes ⋅ Jing Zhang ⋅ Chen Feng
ExHall D Poster #144
A Simple yet Effective Layout Token in Large Language Models for Document Understanding Poster Session 3
Zhaoqing Zhu ⋅ Chuwei Luo ⋅ Zirui Shao ⋅ Feiyu Gao ⋅ Hangdi Xing ⋅ Qi Zheng ⋅ Ji Zhang
ExHall D Poster #365
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Poster Session 4
Jingfeng Yao ⋅ Bin Yang ⋅ Xinggang Wang
ExHall D Poster #371
Attention IoU: Examining Biases in CelebA using Attention Maps Poster Session 1
Aaron Serianni ⋅ Tyler Zhu ⋅ Olga Russakovsky ⋅ Vikram V. Ramaswamy
ExHall D Poster #405
Segment Any Motion in Videos Poster Session 1
Nan Huang ⋅ Wenzhao Zheng ⋅ Chenfeng Xu ⋅ Kurt Keutzer ⋅ Shanghang Zhang ⋅ Angjoo Kanazawa ⋅ Qianqian Wang
ExHall D Poster #309
HandOS: 3D Hand Reconstruction in One Stage Poster Session 4
Xingyu Chen ⋅ Zhuheng Song ⋅ Xiaoke Jiang ⋅ Yaoqing Hu ⋅ Junzhi Yu ⋅ Lei Zhang
ExHall D Poster #142
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding Poster Session 1
Wenbo Chen ⋅ Zhen Xu ⋅ Ruotao Xu ⋅ Si Wu ⋅ Hau San Wong
ExHall D Poster #358
All-Day Multi-Camera Multi-Target Tracking Poster Session 4
Huijie Fan ⋅ Yu Qiao ⋅ Yihao Zhen ⋅ Tinghui Zhao ⋅ Baojie Fan ⋅ Qiang Wang
ExHall D Poster #103
Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB Poster Session 6
Nikhil Behari ⋅ Aaron Young ⋅ Siddharth Somasundaram ⋅ Tzofi Klinghoffer ⋅ Akshat Dave ⋅ Ramesh Raskar
ExHall D Poster #77
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification Poster Session 5
Dongyoon Yang ⋅ Jihu Lee ⋅ Yongdai Kim
ExHall D Poster #454
GASP: Gaussian Avatars with Synthetic Priors Poster Session 1
Jack Saunders ⋅ Charlie Hewitt ⋅ Yanan Jian ⋅ Marek Kowalski ⋅ Tadas Baltrusaitis ⋅ Yiye Chen ⋅ Darren Cosker ⋅ Virginia Estellers ⋅ Nicholas Gydé ⋅ Vinay P. Namboodiri ⋅ Benjamin E Lundell
ExHall D Poster #10
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset Poster Session 6
Xiao Wang ⋅ Yu Jin ⋅ Wentao Wu ⋅ Wei Zhang ⋅ Lin Zhu ⋅ Bo Jiang ⋅ Yonghong Tian
ExHall D Poster #303
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection Poster Session 2
Wei Luo ⋅ Yunkang Cao ⋅ Haiming Yao ⋅ Xiaotian Zhang ⋅ Jianan Lou ⋅ Yuqi Cheng ⋅ Weiming Shen ⋅ Wenyong Yu
ExHall D Poster #438
Augmenting Perceptual Super-Resolution via Image Quality Predictors Poster Session 1
Fengjia Zhang ⋅ Samrudhdhi Rangrej ⋅ Tristan T Aumentado-Armstrong ⋅ Afsaneh Fazly ⋅ Alex Levinshtein
ExHall D Poster #202
TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting Poster Session 2
Liangbin Xie ⋅ Daniil Pakhomov ⋅ Zhonghao Wang ⋅ Zongze Wu ⋅ Ziyan Chen ⋅ Yuqian Zhou ⋅ Haitian Zheng ⋅ Zhifei Zhang ⋅ Zhe Lin ⋅ Jiantao Zhou ⋅ Chao Dong
ExHall D Poster #214
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic Poster Session 1
Jianwei Tang ⋅ Hong Yang ⋅ Tengyue Chen ⋅ Jian-Fang Hu
ExHall D Poster #159
ViKIENet: Towards Efficient 3D Object Detection with Virtual Key Instance Enhanced Network Poster Session 3
Zhuochen Yu ⋅ Bijie Qiu ⋅ Andy W. H. Khong
ExHall D Poster #116
Feature Selection for Latent Factor Models Poster Session 6
Rittwika Kansabanik ⋅ Adrian Barbu
ExHall D Poster #440
MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration Poster Session 2
Boyun Li ⋅ Haiyu Zhao ⋅ Wenxin Wang ⋅ Peng Hu ⋅ Yuanbiao Gou ⋅ Xi Peng
ExHall D Poster #203
Few-shot Implicit Function Generation via Equivariance Poster Session 4
Suizhi Huang ⋅ Xingyi Yang ⋅ Hongtao Lu ⋅ Xinchao Wang
ExHall D Poster #39
Multi-Modal Synergistic Implicit Image Enhancement for Efficient Optical Flow Estimation Poster Session 1
Weichen Dai ⋅ wu hexing ⋅ xiaoyang weng ⋅ Yuxin Zheng ⋅ Yuhang Ming ⋅ Wanzeng Kong
ExHall D Poster #188
Test-Time Backdoor Detection for Object Detection Models Poster Session 5
Hangtao Zhang ⋅ Yichen Wang ⋅ Shihui Yan ⋅ Chenyu Zhu ⋅ Ziqi Zhou ⋅ Linshan Hou ⋅ Shengshan Hu ⋅ Minghui Li ⋅ Yanjun Zhang ⋅ Leo Yu Zhang
ExHall D Poster #320
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval Poster Session 6
Boseung Jeong ⋅ Jicheol Park ⋅ Sungyeon Kim ⋅ Suha Kwak
ExHall D Poster #263
ProtoDepth: Unsupervised Continual Depth Completion with Prototypes Poster Session 2
Patrick Rim ⋅ Hyoungseob Park ⋅ Suchisrit Gangopadhyay ⋅ Ziyao Zeng ⋅ Younjoon Chung ⋅ Alex Wong
ExHall D Poster #85
Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation Poster Session 3
Hadi Alzayer ⋅ Philipp Henzler ⋅ Jonathan T. Barron ⋅ Jia-Bin Huang ⋅ Pratul P. Srinivasan ⋅ Dor Verbin
ExHall D Poster #26
TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation Poster Session 6
Hongxiang Zhao ⋅ Xingchen Liu ⋅ Mutian Xu ⋅ Yiming Hao ⋅ Weikai Chen ⋅ Xiaoguang Han
ExHall D Poster #145
NoT: Federated Unlearning via Weight Negation Poster Session 5
Yasser Khalil ⋅ Leo Maxime Brunswic ⋅ Soufiane Lamghari ⋅ Xu Li ⋅ Mahdi Beitollahi ⋅ Xi Chen
ExHall D Poster #452
ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions Poster Session 1
Jeonghwan Kim ⋅ Jisoo Kim ⋅ Jeonghyeon Na ⋅ Hanbyul Joo
ExHall D Poster #153
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds Poster Session 1
Eitan Shaar ⋅ Ariel Shaulov ⋅ Gal Chechik ⋅ Lior Wolf
ExHall D Poster #285
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction Poster Session 3
Miaowei Wang ⋅ Yibo Zhang ⋅ Rui Ma ⋅ Weiwei Xu ⋅ Changqing Zou ⋅ Daniel Morris
ExHall D Poster #67
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation Poster Session 2
Hui Li ⋅ Mingwang Xu ⋅ Qingkun Su ⋅ Shan Mu ⋅ Li jiaye ⋅ Kaihui Cheng ⋅ Chen Yuxuan ⋅ Tan Chen ⋅ Mao Ye ⋅ Jingdong Wang ⋅ Siyu Zhu
ExHall D Poster #228
Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization Poster Session 3
Anubhav Jain ⋅ Yuya Kobayashi ⋅ Takashi Shibuya ⋅ Yuhta Takida ⋅ Nasir Memon ⋅ Julian Togelius ⋅ Yuki Mitsufuji
ExHall D Poster #212
Vid2Avatar-Pro: Authentic Avatar from Videos in the Wild via Universal Prior Poster Session 2
Chen Guo ⋅ Junxuan Li ⋅ Yash Kant ⋅ Yaser Sheikh ⋅ Shunsuke Saito ⋅ Chen Cao
ExHall D Poster #11
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings Poster Session 5
Aayush Dhakal ⋅ Srikumar Sastry ⋅ Subash Khanal ⋅ Adeel Ahmad ⋅ Eric Xing ⋅ Nathan Jacobs
ExHall D Poster #349
SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction Poster Session 6
Zhengyuan Li ⋅ Kai Cheng ⋅ Anindita Ghosh ⋅ Uttaran Bhattacharya ⋅ Liangyan Gui ⋅ Aniket Bera
ExHall D Poster #158
SerialGen: Personalized Image Generation by First Standardization Then Personalization Poster Session 1
Cong Xie ⋅ Han Zou ⋅ Ruiqi Yu ⋅ Yan Zhang ⋅ Zhan Zhenpeng
ExHall D Poster #257
SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing Poster Session 6
Xueting Li ⋅ Ye Yuan ⋅ Shalini De Mello ⋅ Miles Macklin ⋅ Jonathan Leaf ⋅ Gilles Daviet ⋅ Jan Kautz ⋅ Umar Iqbal
ExHall D Poster #13
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning Poster Session 6
Ziang Li ⋅ Hongguang Zhang ⋅ Juan Wang ⋅ Meihui Chen ⋅ Hongxin Hu ⋅ Wenzhe Yi ⋅ Xiaoyang Xu ⋅ Mengda Yang ⋅ Chenjun Ma
ExHall D Poster #300
Matrix3D: Large Photogrammetry Model All-in-One Poster Session 3
Yuanxun Lu ⋅ Jingyang Zhang ⋅ Tian Fang ⋅ Jean-Daniel Nahmias ⋅ Yanghai Tsin ⋅ Long Quan ⋅ Xun Cao ⋅ Yao Yao ⋅ Shiwei Li
ExHall D Poster #57
Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging Poster Session 1
Ping Wang ⋅ Lishun Wang ⋅ Gang Qu ⋅ Xiaodong Wang ⋅ Yulun Zhang ⋅ Xin Yuan
ExHall D Poster #23
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Poster Session 6
Yuliang Guo ⋅ Sparsh Garg ⋅ S. Mahdi H. Miangoleh ⋅ Xinyu Huang ⋅ Liu Ren
ExHall D Poster #81
Image Quality Assessment: From Human to Machine Preference Poster Session 2
Chunyi Li ⋅ Yuan Tian ⋅ Xiaoyue Ling ⋅ Zicheng Zhang ⋅ Haodong Duan ⋅ Haoning Wu ⋅ Ziheng Jia ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Guo Lu ⋅ Weisi Lin ⋅ Guangtao Zhai
ExHall D Poster #210
Panorama Generation From NFoV Image Done Right Poster Session 5
Dian Zheng ⋅ Cheng Zhang ⋅ Xiao-Ming Wu ⋅ Cao Li ⋅ Chengfei Lv ⋅ Jian-Fang Hu ⋅ Wei-Shi Zheng
ExHall D Poster #53
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition Poster Session 4
Fei Xie ⋅ Jiahao Nie ⋅ Yujin Tang ⋅ Wenkang Zhang ⋅ Hongshen Zhao
ExHall D Poster #412
Robust Message Embedding via Attention Flow-Based Steganography Poster Session 3
Huayuan Ye ⋅ Shenzhuo Zhang ⋅ Shiqi Jiang ⋅ Jing Liao ⋅ Shuhang Gu ⋅ Dejun Zheng ⋅ Changbo Wang ⋅ Chenhui Li
ExHall D Poster #209
Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models Poster Session 4
Zhenguang Liu ⋅ Chao Shuai ⋅ Shaojing Fan ⋅ Ziping Dong ⋅ Jinwu Hu ⋅ Zhongjie Ba ⋅ Kui Ren
ExHall D Poster #274
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels Poster Session 1
Qiming Xia ⋅ Wenkai Lin ⋅ Haoen Xiang ⋅ Xun Huang ⋅ Siheng Chen ⋅ Zhen Dong ⋅ Cheng Wang ⋅ Chenglu Wen
ExHall D Poster #116
DeepLA-Net: Very Deep Local Aggregation Networks for Point Cloud Analysis Poster Session 1
Ziyin Zeng ⋅ Mingyue Dong ⋅ Jian Zhou ⋅ Huan Qiu ⋅ Zhen Dong ⋅ Man Luo ⋅ Bijun Li
ExHall D Poster #108
AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering Poster Session 2
Jing Wang ⋅ Songhe Feng ⋅ Kristoffer Knutsen Wickstrøm ⋅ Michael C. Kampffmeyer
ExHall D Poster #468
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References Poster Session 1
Ming-Feng Li ⋅ Xin Yang ⋅ Fu-En Wang ⋅ Hritam Basak ⋅ Yuyin Sun ⋅ Shreekant Gayaka ⋅ Min Sun ⋅ Cheng-Hao Kuo
ExHall D Poster #94
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval Poster Session 5
Yuanmin Tang ⋅ Jing Yu ⋅ Keke Gai ⋅ Jiamin Zhuang ⋅ Gang Xiong ⋅ Gaopeng Gou ⋅ Qi Wu
ExHall D Poster #359
Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing Poster Session 2
Shiyang Zhou ⋅ Haijin Zeng ⋅ Yunfan Lu ⋅ Tong Shao ⋅ Ke Tang ⋅ Yongyong Chen ⋅ Jie Liu ⋅ Jingyong Su
ExHall D Poster #329
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Poster Session 6
Jianzong Wu ⋅ Chao Tang ⋅ Jingbo Wang ⋅ Yanhong Zeng ⋅ Xiangtai Li ⋅ Yunhai Tong
ExHall D Poster #240
PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset Poster Session 4
Jiazhen Liu ⋅ Yuhan Fu ⋅ Ruobing Xie ⋅ Runquan Xie ⋅ Xingwu Sun ⋅ Fengzong Lian ⋅ Zhanhui Kang ⋅ Xirong Li
ExHall D Poster #386
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate Poster Session 3
Ming Yan ⋅ Xincheng Lin ⋅ Yuhua Luo ⋅ Shuqi Fan ⋅ Yudi Dai ⋅ Qixin Zhong ⋅ Lincai Zhong ⋅ Yuexin Ma ⋅ Lan Xu ⋅ Chenglu Wen ⋅ Siqi Shen ⋅ Cheng Wang
ExHall D Poster #159
Estimating Body and Hand Motion in an Ego‑sensed World Poster Session 2
Brent Yi ⋅ Vickie Ye ⋅ Maya Zheng ⋅ Yunqi Li ⋅ Lea Müller ⋅ Georgios Pavlakos ⋅ Yi Ma ⋅ Jitendra Malik ⋅ Angjoo Kanazawa
ExHall D Poster #164
A Bias-Free Training Paradigm for More General AI-generated Image Detection Poster Session 4
Fabrizio Guillaro ⋅ Giada Zingarini ⋅ Ben Usman ⋅ Avneesh Sud ⋅ Davide Cozzolino ⋅ Luisa Verdoliva
ExHall D Poster #277
Evaluating Vision-Language Models as Evaluators in Path Planning Poster Session 2
Mohamed Aghzal ⋅ Xiang Yue ⋅ Erion Plaku ⋅ Ziyu Yao
ExHall D Poster #145
Transformers without Normalization Poster Session 3
Jiachen Zhu ⋅ Xinlei Chen ⋅ Kaiming He ⋅ Yann LeCun ⋅ Zhuang Liu
ExHall D Poster #406
SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection Poster Session 1
Xin Lin ⋅ Chong Shi ⋅ Zuopeng Yang ⋅ Haojin Tang ⋅ Zhili Zhou
ExHall D Poster #419
Certified Human Trajectory Prediction Poster Session 3
Mohammadhossein Bahari ⋅ Saeed Saadatnejad ⋅ Amirhossein Askari Farsangi ⋅ Seyed-Mohsen Moosavi-Dezfooli ⋅ Alex Alahi
ExHall D Poster #158
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding Poster Session 1
Tianyu Chen ⋅ Xingcheng Fu ⋅ Yisen Gao ⋅ Haodong Qian ⋅ Yuecen Wei ⋅ Kun Yan ⋅ Haoyi Zhou ⋅ Jianxin Li
ExHall D Poster #376
DFM: Differentiable Feature Matching for Anomaly Detection Poster Session 3
Wu Sheng ⋅ Yimi Wang ⋅ Xudong Liu ⋅ Yuguang Yang ⋅ Runqi Wang ⋅ Guodong Guo ⋅ David Doermann ⋅ Baochang Zhang
ExHall D Poster #438
PointSR: Self-Regularized Point Supervision for Drone-View Object Detection Poster Session 3
Weizhuo Li ⋅ Yue Xi ⋅ Wenjing Jia ⋅ zehao zhang ⋅ Fei Li ⋅ Xiangzeng Liu ⋅ Qiguang Miao
ExHall D Poster #102
MVDoppler-Pose: Multi-Modal Multi-View mmWave Sensing for Long-Distance Self-Occluded Human Walking Pose Estimation Poster Session 6
Jae-Ho Choi ⋅ Soheil Hor ⋅ Shubo Yang ⋅ Amin Arbabian
ExHall D Poster #151
Gain from Neighbors: Boosting Model Robustness in the Wild via Adversarial Perturbations Toward Neighboring Classes Poster Session 5
Zhou Yang ⋅ Mingtao Feng ⋅ Tao Huang ⋅ Fangfang Wu ⋅ Weisheng Dong ⋅ Xin Li ⋅ Guangming Shi
ExHall D Poster #426
De^2Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation Poster Session 1
Yunfeng Xiao ⋅ Xiaowei Bai ⋅ Baojun Chen ⋅ Hao Su ⋅ Hao He ⋅ Liang Xie ⋅ Erwei Yin
ExHall D Poster #280
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation Poster Session 6
Zixuan Chen ⋅ Jiaxin Li ⋅ Junxuan Liang ⋅ Liming Tan ⋅ Yejie Guo ⋅ Cewu Lu ⋅ Yonglu Li
ExHall D Poster #291
Language-Guided Image Tokenization for Generation Poster Session 4
Kaiwen Zha ⋅ Lijun Yu ⋅ Alireza Fathi ⋅ David A. Ross ⋅ Cordelia Schmid ⋅ Dina Katabi ⋅ Xiuye Gu
ExHall D Poster #252
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models Poster Session 2
Qirui Jiao ⋅ Daoyuan Chen ⋅ Yilun Huang ⋅ Bolin Ding ⋅ Yaliang Li ⋅ Ying Shen
ExHall D Poster #374
CocoER: Aligning Multi-Level Feature by Competition and Coordination for Emotion Recognition Poster Session 6
Xuli Shen ⋅ Hua Cai ⋅ Weilin Shen ⋅ Qing Xu ⋅ Dingding Yu ⋅ Weifeng Ge ⋅ Xiangyang Xue
ExHall D Poster #328
DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables Poster Session 2
Sidi Yang ⋅ Binxiao Huang ⋅ Yulun Zhang ⋅ Dahai Yu ⋅ Yujiu Yang ⋅ Ngai Wong
ExHall D Poster #211
EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Poster Session 3
Sheng Miao ⋅ Jiaxin Huang ⋅ Dongfeng Bai ⋅ Xu Yan ⋅ Hongyu Zhou ⋅ Yue Wang ⋅ Bingbing Liu ⋅ Andreas Geiger ⋅ Yiyi Liao
ExHall D Poster #60
Incomplete Multi-View Multi-label Learning via Disentangled Representation and Label Semantic Embedding Poster Session 6
Xu Yan ⋅ Jun Yin ⋅ Jie Wen
ExHall D Poster #438
EquiPose: Exploiting Permutation Equivariance for Relative Camera Pose Estimation Poster Session 1
Yuzhen Liu ⋅ Qiulei Dong
ExHall D Poster #89
PDFactor: Learning Tri-Perspective View Policy Diffusion Field for Multi-Task Robotic Manipulation Poster Session 4
Jingyi Tian ⋅ Le Wang ⋅ Sanping Zhou ⋅ Sen Wang ⋅ lijiayi ⋅ Haowen Sun ⋅ Wei Tang
ExHall D Poster #144
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild Poster Session 3
Rolandos Alexandros Potamias ⋅ Jinglei Zhang ⋅ Jiankang Deng ⋅ Stefanos Zafeiriou
ExHall D Poster #153
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos Poster Session 1
Ziyang Wang ⋅ Shoubin Yu ⋅ Elias Stengel-Eskin ⋅ Jaehong Yoon ⋅ Feng Cheng ⋅ Gedas Bertasius ⋅ Mohit Bansal
ExHall D Poster #297
Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation Poster Session 1
Qiao Yu ⋅ Xianzhi Li ⋅ Yuan Tang ⋅ Xu Han ⋅ Long Hu ⋅ yixue Hao ⋅ Min Chen
ExHall D Poster #40
Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes Poster Session 4
Suhyun Shin ⋅ Seungwoo Yoon ⋅ Ryota Maeda ⋅ Seung-Hwan Baek
ExHall D Poster #72
Language-Guided Audio-Visual Learning for Long-Term Sports Assessment Poster Session 5
Huangbiao Xu ⋅ Xiao Ke ⋅ Huanqi Wu ⋅ Rui Xu ⋅ Yuezhou Li ⋅ Wenzhong Guo
ExHall D Poster #282
PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes Poster Session 1
Bin Tan ⋅ Rui Yu ⋅ Yujun Shen ⋅ Nan Xue
ExHall D Poster #95
Boltzmann Attention Sampling for Image Analysis with Small Objects Poster Session 5
Theodore Zhao ⋅ Sid Kiblawi ⋅ Mu Wei ⋅ Ho Hin Lee ⋅ J. Samuel Preston ⋅ Naoto Usuyama ⋅ Hoifung Poon
ExHall D Poster #472
Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise Poster Session 6
Brayan Monroy ⋅ Jorge Bacca ⋅ Julián Tachella
ExHall D Poster #190
Dynamic Motion Blending for Versatile Motion Editing Poster Session 5
Nan Jiang ⋅ Hongjie Li ⋅ Ziye Yuan ⋅ Zimo He ⋅ Yixin Chen ⋅ Tengyu Liu ⋅ Yixin Zhu ⋅ Siyuan Huang
ExHall D Poster #159
Open Set Label Shift with Test Time Out-of-Distribution Reference Poster Session 6
Changkun Ye ⋅ Russell Tsuchida ⋅ Lars Petersson ⋅ Nick Barnes
ExHall D Poster #428
Action Detail Matters: Refining Video Recognition with Local Action Queries Poster Session 4
Mengmeng Wang ⋅ Zeyi Huang ⋅ Xiangjie Kong ⋅ Guojiang Shen ⋅ Guang Dai ⋅ Jingdong Wang ⋅ Yong Liu
ExHall D Poster #318
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images Poster Session 6
Yuze He ⋅ Yanning Zhou ⋅ Wang Zhao ⋅ Zhongkai Wu ⋅ Kaiwen Xiao ⋅ Yang Wei ⋅ Yong-Jin Liu ⋅ Xiao Han
ExHall D Poster #15
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving Poster Session 3
Bencheng Liao ⋅ Shaoyu Chen ⋅ haoran yin ⋅ Bo Jiang ⋅ Cheng Wang ⋅ Sixu Yan ⋅ xinbang zhang ⋅ Xiangyu Li ⋅ ying zhang ⋅ Qian Zhang ⋅ Xinggang Wang
ExHall D Poster #134
Maintaining Consistent Inter-Class Topology in Continual Test-Time Adaptation Poster Session 3
Chenggong Ni ⋅ Fan Lyu ⋅ Jiayao Tan ⋅ Fuyuan Hu ⋅ Rui Yao ⋅ Tao Zhou
ExHall D Poster #447
UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection Poster Session 3
Zhaopeng Gu ⋅ Bingke Zhu ⋅ Guibo Zhu ⋅ Yingying Chen ⋅ Ming Tang ⋅ Jinqiao Wang
ExHall D Poster #435
FFR: Frequency Feature Rectification for Weakly Supervised Semantic Segmentation Poster Session 6
Ziqian Yang ⋅ Xinqiao Zhao ⋅ Xiaolei Wang ⋅ Quan Zhang ⋅ Jimin Xiao
ExHall D Poster #394
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding Poster Session 6
Yan Shu ⋅ Zheng Liu ⋅ Peitian Zhang ⋅ Minghao Qin ⋅ Junjie Zhou ⋅ Zhengyang Liang ⋅ Tiejun Huang ⋅ Bo Zhao
ExHall D Poster #339
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions Poster Session 4
Boran Wen ⋅ Dingbang Huang ⋅ Zichen Zhang ⋅ Jiahong Zhou ⋅ Jianbin Deng ⋅ Jingyu Gong ⋅ Yulong Chen ⋅ Lizhuang Ma ⋅ Yonglu Li
ExHall D Poster #156
Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy Poster Session 6
Joonhyun Jeong ⋅ Seyun Bae ⋅ Yeonsung Jung ⋅ Jaeryong Hwang ⋅ Eunho Yang
ExHall D Poster #362
Sonata: Self-Supervised Learning of Reliable Point Representations Poster Session 5
Xiaoyang Wu ⋅ Daniel DeTone ⋅ Duncan Frost ⋅ TIANWEI SHEN ⋅ Chris Xie ⋅ Nan Yang ⋅ Jakob Engel ⋅ Richard Newcombe ⋅ Hengshuang Zhao ⋅ Julian Straub
ExHall D Poster #109
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation Poster Session 2
Fanding Huang ⋅ Jingyan Jiang ⋅ Qinting Jiang ⋅ Li Hebei ⋅ Faisal Nadeem Khan ⋅ Zhi Wang
ExHall D Poster #419
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation Poster Session 6
Hongbin Lin ⋅ Zilu Guo ⋅ Yifan Zhang ⋅ Shuaicheng Niu ⋅ Yafeng Li ⋅ Ruimao Zhang ⋅ Shuguang Cui ⋅ Zhen Li
ExHall D Poster #128
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis Poster Session 3
Zixuan Wang ⋅ DUO PENG ⋅ Feng Chen ⋅ Yuwei Yang ⋅ Yinjie Lei
ExHall D Poster #237
h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform Poster Session 6
Toan Nguyen ⋅ Kien Do ⋅ Duc Kieu ⋅ Thin Nguyen
ExHall D Poster #222
VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing Poster Session 5
Juan Luis Gonzalez Bello ⋅ Xu Yao ⋅ Alex Whelan ⋅ Kyle Olszewski ⋅ Hyeongwoo Kim ⋅ Pablo Garrido
ExHall D Poster #175
Online Video Understanding: OVBench and VideoChat-Online Poster Session 1
Zhenpeng Huang ⋅ Xinhao Li ⋅ Jiaqi Li ⋅ Jing Wang ⋅ Xiangyu Zeng ⋅ Cheng Liang ⋅ Tao Wu ⋅ Xi Chen ⋅ Liang Li ⋅ Limin Wang
ExHall D Poster #302
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting Poster Session 5
Yifei Qian ⋅ Zhongliang Guo ⋅ Bowen Deng ⋅ Chun Tong Lei ⋅ Shuai Zhao ⋅ Chun Pong Lau ⋅ Xiaopeng Hong ⋅ Michael Pound
ExHall D Poster #410
TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning Poster Session 3
Seungmin Baek ⋅ Soyul Lee ⋅ Hayeon Jo ⋅ Hyesong Choi ⋅ Dongbo Min
ExHall D Poster #402
A Unified Approach to Interpreting Self-supervised Pre-training Methods for 3D Point Clouds via Interactions Poster Session 6
Qiang Li ⋅ Jian Ruan ⋅ Fanghao Wu ⋅ Yuchi Chen ⋅ Zhihua Wei ⋅ Wen Shen
ExHall D Poster #111
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Poster Session 5
Aishik Konwer ⋅ Zhijian Yang ⋅ Erhan Bas ⋅ Cao Xiao ⋅ Prateek Prasanna ⋅ Parminder Bhatia ⋅ Taha Kass-Hout
ExHall D Poster #456
MERGE: Multi-faceted Hierarchical Graph-based GNN for Gene Expression Prediction from Whole Slide Histopathology Images Poster Session 3
Aniruddha Ganguly ⋅ Debolina Chatterjee ⋅ Wentao Huang ⋅ Jie Zhang ⋅ Alisa Yurovsky ⋅ Travis Steele Johnson ⋅ Chao Chen
ExHall D Poster #475
D^3-Human: Dynamic Disentangled Digital Human from Monocular Video Poster Session 3
Honghu Chen ⋅ Bo Peng ⋅ Yunfan Tao ⋅ Juyong Zhang
ExHall D Poster #17
Open Ad-hoc Categorization with Contextualized Feature Learning Poster Session 3
Zilin Wang ⋅ Sangwoo Mo ⋅ Stella X. Yu ⋅ Sima Behpour ⋅ Liu Ren
ExHall D Poster #427
Accurate Differential Operators for Hybrid Neural Fields Poster Session 1
Aditya Chetan ⋅ Guandao Yang ⋅ Zichen Wang ⋅ Steve Marschner ⋅ Bharath Hariharan
ExHall D Poster #34
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution Poster Session 1
Huan Zheng ⋅ Wencheng Han ⋅ Jianbing Shen
ExHall D Poster #73
Simpler Diffusion: 1.5 FID on ImageNet512 with Pixel-space Diffusion Poster Session 4
Emiel Hoogeboom ⋅ Thomas Mensink ⋅ Jonathan Heek ⋅ Kay Lamerigts ⋅ Ruiqi Gao ⋅ Tim Salimans
ExHall D Poster #215
Few-shot Personalized Scanpath Prediction Poster Session 3
Ruoyu Xue ⋅ Jingyi Xu ⋅ Sounak Mondal ⋅ Hieu Le ⋅ Gregory Zelinsky ⋅ Minh Hoai ⋅ Dimitris Samaras
ExHall D Poster #272
EZSR: Event-based Zero-Shot Recognition Poster Session 1
Yan Yang ⋅ Liyuan Pan ⋅ Dongxu Li ⋅ Liu Liu
ExHall D Poster #427
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation Poster Session 3
Sen Wang ⋅ Le Wang ⋅ Sanping Zhou ⋅ Jingyi Tian ⋅ lijiayi ⋅ Haowen Sun ⋅ Wei Tang
ExHall D Poster #147
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation Poster Session 2
Zilong Chen ⋅ Yikai Wang ⋅ Wenqiang Sun ⋅ Feng Wang ⋅ Yiwen Chen ⋅ Huaping Liu
ExHall D Poster #37
GIFStream: 4D Gaussian-based Immersive Video with Feature Stream Poster Session 5
Hao Li ⋅ Sicheng Li ⋅ Xiang Gao ⋅ AbudouaihatiBatuer ⋅ Lu Yu ⋅ Yiyi Liao
ExHall D Poster #67
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds Poster Session 5
Mohamed Abdelsamad ⋅ Michael Ulrich ⋅ Claudius Glaeser ⋅ Abhinav Valada
ExHall D Poster #113
Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models Poster Session 5
Zhaoyi Liu ⋅ Huan Zhang
ExHall D Poster #384
Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction Poster Session 5
Kaixin Fan ⋅ Pengfei Ren ⋅ Jingyu Wang ⋅ Haifeng Sun ⋅ Qi Qi ⋅ Zirui Zhuang ⋅ Jianxin Liao
ExHall D Poster #149
ReDiffDet: Rotation-equivariant Diffusion Model for Oriented Object Detection Poster Session 5
Jiaqi Zhao ⋅ Zeyu Ding ⋅ Yong Zhou ⋅ Hancheng Zhu ⋅ Wen-Liang Du ⋅ Rui Yao
ExHall D Poster #325
PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation Poster Session 2
HsiaoYuan Hsu ⋅ Yuxin Peng
ExHall D Poster #262
Rethinking Correspondence-based Category-Level Object Pose Estimation Poster Session 1
Huan Ren ⋅ Wenfei Yang ⋅ Shifeng Zhang ⋅ Tianzhu Zhang
ExHall D Poster #93
Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement Poster Session 5
Qiyuan Dai ⋅ Hanzhuo Huang ⋅ Yu Wu ⋅ Sibei Yang
ExHall D Poster #420
Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset Poster Session 2
Yiqun Mei ⋅ Mingming He ⋅ Li Ma ⋅ Julien Philip ⋅ Wenqi Xian ⋅ David M George ⋅ Xueming Yu ⋅ Gabriel Dedic ⋅ Ahmet Levent Taşel ⋅ Ning Yu ⋅ Vishal M. Patel ⋅ Paul Debevec
ExHall D Poster #6
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention Poster Session 2
Lianghui Zhu ⋅ Zilong Huang ⋅ Bencheng Liao ⋅ Jun Hao Liew ⋅ Hanshu Yan ⋅ Jiashi Feng ⋅ Xinggang Wang
ExHall D Poster #219
Locally Orderless Images for Optimization in Differentiable Rendering Poster Session 2
Ishit Mehta ⋅ Manmohan Chandraker ⋅ Ravi Ramamoorthi
ExHall D Poster #30
Rethinking Training for De-biasing Text-to-Image Generation: Unlocking the Potential of Stable Diffusion Poster Session 3
Eunji Kim ⋅ Siwon Kim ⋅ Minjun Park ⋅ Rahim Entezari ⋅ Sungroh Yoon
ExHall D Poster #258
FLAIR: VLM with Fine-grained Language-informed Image Representations Poster Session 5
Rui Xiao ⋅ Sanghwan Kim ⋅ Iuliana Georgescu ⋅ Zeynep Akata ⋅ Stephan Alaniz
ExHall D Poster #368
GG-SSMs: Graph-Generating State Space Models Poster Session 6
Nikola Zubic ⋅ Davide Scaramuzza
ExHall D Poster #257
STDD: Spatio-Temporal Dual Diffusion for Video Generation Poster Session 3
Shuaizhen Yao ⋅ Xiaoya Zhang ⋅ Xin Liu ⋅ Mengyi Liu ⋅ Zhen Cui
ExHall D Poster #183
Continuous Adverse Weather Removal via Degradation-Aware Distillation Poster Session 6
Xin Lu ⋅ Jie Xiao ⋅ Yurui Zhu ⋅ Xueyang Fu
ExHall D Poster #185
Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models Poster Session 2
Kartik Thakral ⋅ Tamar Glaser ⋅ Tal Hassner ⋅ Mayank Vatsa ⋅ Richa Singh
ExHall D Poster #358
CryptoFace: End-to-End Encrypted Face Recognition Poster Session 4
Wei Ao ⋅ Vishnu Naresh Boddeti
ExHall D Poster #324
Exploiting Temporal State Space Sharing for Video Semantic Segmentation Poster Session 5
Hesham Syed ⋅ Yun Liu ⋅ Guolei Sun ⋅ Henghui Ding ⋅ Jing Yang ⋅ Ender Konukoglu ⋅ Xue Geng ⋅ Xudong Jiang
ExHall D Poster #305
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models Poster Session 2
Yongqi Huang ⋅ Peng Ye ⋅ Chenyu Huang ⋅ Jianjian Cao ⋅ Lin Zhang ⋅ Baopu Li ⋅ Gang Yu ⋅ Tao Chen
ExHall D Poster #446
GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation Poster Session 3
Haifeng Wu ⋅ Shuhang Gu ⋅ Lixin Duan ⋅ Wen Li
ExHall D Poster #84
SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization Poster Session 4
Junchen Yu ⋅ Siyuan Cao ⋅ Runmin Zhang ⋅ Chenghao Zhang ⋅ Zhu Yu ⋅ Shujie Chen ⋅ Bailin Yang ⋅ Hui-Liang Shen
ExHall D Poster #82
High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model Poster Session 5
Yiyang Shen ⋅ Kun Zhou ⋅ He Wang ⋅ Yin Yang ⋅ Tianjia Shao
ExHall D Poster #48
Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild Poster Session 5
Junhyeong Cho ⋅ Kim Youwang ⋅ Hunmin Yang ⋅ Tae-Hyun Oh
ExHall D Poster #164
Adventurer: Optimizing Vision Mamba Architecture Designs for Efficiency Poster Session 6
Feng Wang ⋅ Timing Yang ⋅ Yaodong Yu ⋅ Sucheng Ren ⋅ Guoyizhe Wei ⋅ Angtian Wang ⋅ Wei Shao ⋅ Yuyin Zhou ⋅ Alan L. Yuille ⋅ Cihang Xie
ExHall D Poster #384
DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation Poster Session 4
Xiaoliang Ju ⋅ Hongsheng Li
ExHall D Poster #36
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient Poster Session 5
Zigeng Chen ⋅ Xinyin Ma ⋅ Gongfan Fang ⋅ Xinchao Wang
ExHall D Poster #218
Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration Poster Session 6
Zilong Huang ⋅ Jun He ⋅ Junyan Ye ⋅ Lihan Jiang ⋅ Weijia Li ⋅ Yiping Chen ⋅ Ting Han
ExHall D Poster #55
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Poster Session 3
Zhiyang Guo ⋅ Jinxu Xiang ⋅ Kai Ma ⋅ Wengang Zhou ⋅ Houqiang Li ⋅ Ran Zhang
ExHall D Poster #12
IterIS: Iterative Inference-Solving Alignment for LoRA Merging Poster Session 1
Hongxu chen ⋅ Zhen Wang ⋅ Runshi Li ⋅ Bowei Zhu ⋅ Long Chen
ExHall D Poster #446
ACAttack: Adaptive Cross Attacking RGB-T Tracker via Multi-Modal Response Decoupling Poster Session 5
Xinyu Xiang ⋅ Qinglong Yan ⋅ HAO ZHANG ⋅ Jiayi Ma
ExHall D Poster #100
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos Poster Session 5
Zijia Lu ⋅ ASM Iftekhar ⋅ Gaurav Mittal ⋅ Tianjian Meng ⋅ Xiawei Wang ⋅ Cheng Zhao ⋅ Rohith Kukkala ⋅ Ehsan Elhamifar ⋅ Mei Chen
ExHall D Poster #291
Efficient ANN-Guided Distillation: Aligning Rate-based Features of Spiking Neural Networks through Hybrid Block-wise Replacement Poster Session 2
Shu Yang ⋅ Chengting Yu ⋅ Lei Liu ⋅ Hanzhi Ma ⋅ Aili Wang ⋅ Erping Li
ExHall D Poster #443
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning Poster Session 4
Xiangtao Zhang ⋅ Sheng Li ⋅ Ao Li ⋅ Yipeng Liu ⋅ Fan Zhang ⋅ Ce Zhu ⋅ Le Zhang
ExHall D Poster #459
SmartEraser: Remove Anything from Images using Masked-Region Guidance Poster Session 5
Longtao Jiang ⋅ Zhendong Wang ⋅ Jianmin Bao ⋅ Wengang Zhou ⋅ Dongdong Chen ⋅ Lei Shi ⋅ Dong Chen ⋅ Houqiang Li
ExHall D Poster #327
Sample- and Parameter-Efficient Auto-Regressive Image Models Poster Session 6
Elad Amrani ⋅ Leonid Karlinsky ⋅ Alex M. Bronstein
ExHall D Poster #380
LOCORE: Image Re-ranking with Long-Context Sequence Modeling Poster Session 2
Zilin Xiao ⋅ Pavel Suma ⋅ Ayush Sachdeva ⋅ Hao-Jen Wang ⋅ Giorgos Kordopatis-Zilos ⋅ Giorgos Tolias ⋅ Vicente Ordonez
ExHall D Poster #401
NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction Poster Session 3
Wenyuan Zhang ⋅ Emily Yue-ting Jia ⋅ Junsheng Zhou ⋅ Baorui Ma ⋅ Kanle Shi ⋅ Yu-Shen Liu ⋅ Zhizhong Han
ExHall D Poster #63
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios Poster Session 6
Kai Wang ⋅ Zekai Li ⋅ Zhi-Qi Cheng ⋅ Samir Khaki ⋅ Ahmad Sajedi ⋅ Ramakrishna Vedantam ⋅ Konstantinos N. Plataniotis ⋅ Alexander G. Hauptmann ⋅ Yang You
ExHall D Poster #412
Visual Representation Learning through Causal Intervention for Controllable Image Editing Poster Session 5
Shanshan Huang ⋅ Haoxuan Li ⋅ Chunyuan Zheng ⋅ Lei Wang ⋅ Guorui Liao ⋅ Zhili Gong ⋅ Huayi Yang ⋅ Li Liu
ExHall D Poster #232
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Poster Session 6
Bingda Tang ⋅ Sayak Paul ⋅ Boyang Zheng ⋅ Saining Xie
ExHall D Poster #231
Deformable Radial Kernel Splatting Poster Session 5
Yihua Huang ⋅ Mingxian Lin ⋅ Yangtian Sun ⋅ Ziyi Yang ⋅ Xiaoyang Lyu ⋅ Yan-Pei Cao ⋅ Xiaojuan Qi
ExHall D Poster #44
Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection Poster Session 6
Zhen Qu ⋅ Xian Tao ⋅ Xinyi Gong ⋅ ShiChen Qu ⋅ Qiyu Chen ⋅ Zhengtao Zhang ⋅ Xingang Wang ⋅ Guiguang Ding
ExHall D Poster #407
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation Poster Session 3
Chanyoung Kim ⋅ Dayun Ju ⋅ Woojung Han ⋅ Ming-Hsuan Yang ⋅ Seong Jae Hwang
ExHall D Poster #420
Geometry Field Splatting with Gaussian Surfels Poster Session 2
Kaiwen Jiang ⋅ Venkataram Sivaram ⋅ Cheng Peng ⋅ Ravi Ramamoorthi
ExHall D Poster #29
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos Poster Session 3
Linyi Jin ⋅ Richard Tucker ⋅ Zhengqi Li ⋅ David Fouhey ⋅ Noah Snavely ⋅ Aleksander Holynski
ExHall D Poster #88
PS-EIP: Robust Photometric Stereo Based on Event Interval Profile Poster Session 2
Kazuma Kitazawa ⋅ Takahito Aoto ⋅ Satoshi Ikehata ⋅ Tsuyoshi Takatani
ExHall D Poster #77
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors Poster Session 1
An Li ⋅ Zhe Zhu ⋅ Mingqiang Wei
ExHall D Poster #106
FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation Poster Session 4
Kefan Chen ⋅ Chaerin Min ⋅ Linguang Zhang ⋅ Shreyas Hampali ⋅ Cem Keskin ⋅ Srinath Sridhar
ExHall D Poster #158
Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling Poster Session 5
Guillem Capellera ⋅ Antonio Rubio ⋅ Luis Ferraz ⋅ Antonio Agudo
ExHall D Poster #135
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat Poster Session 5
Zhipeng Huang ⋅ Shaobin Zhuang ⋅ Canmiao Fu ⋅ Binxin Yang ⋅ Ying Zhang ⋅ Chong Sun ⋅ Chen Li ⋅ Yali Wang ⋅ Zhizheng Zhang ⋅ Zheng-Jun Zha
ExHall D Poster #253
HRAvatar: High-Quality and Relightable Gaussian Head Avatar Poster Session 6
Dongbin Zhang ⋅ Yunfei Liu ⋅ Lijian Lin ⋅ Ye Zhu ⋅ Kangjie Chen ⋅ Minghan Qin ⋅ Yu Li ⋅ Haoqian Wang
ExHall D Poster #8
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Poster Session 2
Yousef Yeganeh ⋅ Ioannis Charisiadis ⋅ Marta Hasny ⋅ Martin Hartenberger ⋅ Björn Ommer ⋅ Nassir Navab ⋅ Azade Farshad ⋅ Ehsan Adeli
ExHall D Poster #222
Rethinking Spiking Self-Attention Mechanism: Implementing α-XNOR Similarity Calculation in Spiking Transformers Poster Session 2
Yichen Xiao ⋅ Shuai Wang ⋅ Dehao Zhang ⋅ Wenjie Wei ⋅ Yimeng Shan ⋅ Xiaoli Liu ⋅ Yulin Jiang ⋅ Malu Zhang
ExHall D Poster #310
MagicQuill: An Intelligent Interactive Image Editing System Poster Session 3
Zichen Liu ⋅ Yue Yu ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Ka Leong Cheng ⋅ Wen Wang ⋅ Zhiheng Liu ⋅ Qifeng Chen ⋅ Yujun Shen
ExHall D Poster #231
Hunyuan-Portrait: Implicit Condition Control for Enhanced Portrait Animation Poster Session 4
Zunnan Xu ⋅ Zhentao Yu ⋅ Zixiang Zhou ⋅ Jun Zhou ⋅ Xiaoyu Jin ⋅ Fa-Ting Hong ⋅ Xiaozhong Ji ⋅ Junwei Zhu ⋅ Chengfei Cai ⋅ Shiyu Tang ⋅ Qin Lin ⋅ Xiu Li ⋅ qinglin lu
ExHall D Poster #5
HeMoRa: Unsupervised Heuristic Consensus Sampling for Robust Point Cloud Registration Poster Session 1
Shaocheng Yan ⋅ Yiming Wang ⋅ Kaiyan Zhao ⋅ Pengcheng Shi ⋅ Zhenjun Zhao ⋅ Yongjun Zhang ⋅ Jiayuan Li
ExHall D Poster #111
Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds Poster Session 2
Huitong Chen ⋅ Yu Wang ⋅ Yan Fan ⋅ Guosong Jiang ⋅ Qinghua Hu
ExHall D Poster #452
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces Poster Session 4
Chenyangguang Zhang ⋅ Alexandros Delitzas ⋅ Fangjinhua Wang ⋅ Ruida Zhang ⋅ Xiangyang Ji ⋅ Marc Pollefeys ⋅ Francis Engelmann
ExHall D Poster #343
Boosting Adversarial Transferability through Augmentation in Hypothesis Space Poster Session 4
Yu Guo ⋅ Weiquan Liu ⋅ Qingshan Xu ⋅ Shijun Zheng ⋅ Shujun Huang ⋅ Yu Zang ⋅ Siqi Shen ⋅ Chenglu Wen ⋅ Cheng Wang
ExHall D Poster #322
AniMo: Species-Aware Model for Text-Driven Animal Motion Generation Poster Session 1
Xuan Wang ⋅ Kai Ruan ⋅ Xing Zhang ⋅ Gaoang Wang
ExHall D Poster #163
EditAR: Unified Conditional Generation with Autoregressive Models Poster Session 2
Jiteng Mu ⋅ Nuno Vasconcelos ⋅ Xiaolong Wang
ExHall D Poster #242
Instance-wise Supervision-level Optimization in Active Learning Poster Session 1
Shinnosuke Matsuo ⋅ Riku Togashi ⋅ Ryoma Bise ⋅ Seiichi Uchida ⋅ Masahiro Nomura
ExHall D Poster #456
ViiNeuS: Volumetric Initialization for Implicit Neural Surface Reconstruction of Urban Scenes with Limited Image Overlap Poster Session 3
Hala Djeghim ⋅ Nathan Piasco ⋅ Moussab Bennehar ⋅ Luis Guillermo Roldao Jimenez ⋅ Dzmitry Tsishkou ⋅ Désiré Sidibé
ExHall D Poster #117
Model Diagnosis and Correction via Linguistic and Implicit Attribute Editing Poster Session 3
Xuanbai Chen ⋅ Xiang Xu ⋅ Zhihua Li ⋅ Tianchen Zhao ⋅ Pietro Perona ⋅ Qin ZHANG ⋅ Yifan Xing
ExHall D Poster #347
STAA-SNN: Spatial-Temporal Attention Aggregator for Spiking Neural Networks Poster Session 3
Tianqing Zhang ⋅ Kairong Yu ⋅ Xian Zhong ⋅ Hongwei Wang ⋅ Qi Xu ⋅ Qiang Zhang
ExHall D Poster #316
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning Poster Session 4
Zijian Gao ⋅ Wangwang Jia ⋅ Xingxing Zhang ⋅ Dulan Zhou ⋅ Kele Xu ⋅ Feng Dawei ⋅ Yong Dou ⋅ Xinjun Mao ⋅ Huaimin Wang
ExHall D Poster #449
A Distractor-Aware Memory for Visual Object Tracking with SAM2 Poster Session 5
Alan Lukezic ⋅ Jovana Videnović ⋅ Matej Kristan
ExHall D Poster #309
Activating Sparse Part Concepts for 3D Class Incremental Learning Poster Session 6
Zhenya Tian ⋅ Jun Xiao ⋅ Liu lupeng ⋅ Haiyong Jiang
ExHall D Poster #402
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding Poster Session 5
Qihang Peng ⋅ Henry Zheng ⋅ Gao Huang
ExHall D Poster #340
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis Poster Session 6
Weiguang Zhao ⋅ Rui Zhang ⋅ Qiufeng Wang ⋅ Guangliang Cheng ⋅ Kaizhu Huang
ExHall D Poster #310
Stable Flow: Vital Layers for Training-Free Image Editing Poster Session 2
Omri Avrahami ⋅ Or Patashnik ⋅ Ohad Fried ⋅ Egor Nemchinov ⋅ Kfir Aberman ⋅ Dani Lischinski ⋅ Daniel Cohen-Or
ExHall D Poster #240
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval Poster Session 4
Arun Reddy ⋅ Alexander Martin ⋅ Eugene Yang ⋅ Andrew Yates ⋅ Kate Sanders ⋅ Kenton Murray ⋅ Reno Kriz ⋅ Celso M. de Melo ⋅ Benjamin Van Durme ⋅ Rama Chellappa
ExHall D Poster #370
Flexible Group Count Enables Hassle-Free Structured Pruning Poster Session 1
Jiamu Zhang ⋅ Shaochen (Henry) Zhong ⋅ Andrew Ye ⋅ Zirui Liu ⋅ Sebastian Zhao ⋅ Kaixiong Zhou ⋅ Li Li ⋅ Soo-Hyun Choi ⋅ Rui Chen ⋅ Xia Hu ⋅ Shuai Xu ⋅ Vipin Chaudhary
ExHall D Poster #444
Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation Poster Session 1
Nadav Z. Cohen ⋅ Oron Nir ⋅ Ariel Shamir
ExHall D Poster #237
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation Poster Session 2
Antoni Bigata Casademunt ⋅ Michał Stypułkowski ⋅ Rodrigo Mira ⋅ Stella Bounareli ⋅ Konstantinos Vougioukas ⋅ Zoe Landgraf ⋅ Nikita Drobyshev ⋅ Maciej Zieba ⋅ Stavros Petridis ⋅ Maja Pantic
ExHall D Poster #3
Mitigating Ambiguities in 3D Classification with Gaussian Splatting Poster Session 6
Ruiqi Zhang ⋅ Hao Zhu ⋅ Jingyi Zhao ⋅ Qi Zhang ⋅ Xun Cao ⋅ Zhan Ma
ExHall D Poster #107
Exposure-slot: Exposure-centric Representations Learning with Slot-in-Slot Attention for Region-aware Exposure Correction Poster Session 4
Donggoo Jung ⋅ DAEHYUN KIM ⋅ Guanghui Wang ⋅ Tae Hyun Kim
ExHall D Poster #199
CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning Poster Session 6
Jiangpeng He ⋅ Zhihao Duan ⋅ Fengqing Zhu
ExHall D Poster #420
Generative Modeling of Class Probability for Multi-Modal Representation Learning Poster Session 4
JungKyoo Shin ⋅ Bumsoo Kim ⋅ Eunwoo Kim
ExHall D Poster #469
VisionZip: Longer is Better but Not Necessary in Vision Language Models Poster Session 4
Senqiao Yang ⋅ Yukang Chen ⋅ Zhuotao Tian ⋅ Chengyao Wang ⋅ Jingyao Li ⋅ Bei Yu ⋅ Jiaya Jia
ExHall D Poster #380
Simplification Is All You Need against Out-of-Distribution Overconfidence Poster Session 1
Keke Tang ⋅ Chao Hou ⋅ Weilong Peng ⋅ Xiang Fang ⋅ Zhize Wu ⋅ Yongwei Nie ⋅ Wenping Wang ⋅ Zhihong Tian
ExHall D Poster #465
LOD-GS: Achieving Levels of Detail using Scalable Gaussian Soup Poster Session 1
Jianxiong Shen ⋅ Yue Qian ⋅ Xiaohang Zhan
ExHall D Poster #47
VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow Poster Session 4
Yancong Lin ⋅ Shiming Wang ⋅ Liangliang Nan ⋅ Julian F. P. Kooij ⋅ Holger Caesar
ExHall D Poster #128
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach Poster Session 1
Vaibhav Rathore ⋅ Shubhranil B ⋅ Saikat Dutta ⋅ Sarthak Mehrotra ⋅ Zsolt Kira ⋅ Biplab Banerjee
ExHall D Poster #453
Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis Poster Session 5
Feng Zhou ⋅ Ruiyang Liu ⋅ chen liu ⋅ Gaofeng He ⋅ Yonglu Li ⋅ Xiaogang Jin ⋅ Huamin Wang
ExHall D Poster #257
Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation Poster Session 6
Joohyun Kwon ⋅ Hanbyel Cho ⋅ Junmo Kim
ExHall D Poster #68
SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction Poster Session 5
Kai Chen ⋅ Xiaodong Zhao ⋅ Yujie Huang ⋅ GuoyuFang ⋅ Xiao Song ⋅ Ruiping Wang ⋅ Ziyuan Wang
ExHall D Poster #134
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields Poster Session 3
Kwan Yun ⋅ Chaelin Kim ⋅ Hangyeul Shin ⋅ Junyong Noh
ExHall D Poster #16
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations Poster Session 1
Shengeng Tang ⋅ Jiayi He ⋅ Lechao Cheng ⋅ Jingjing Wu ⋅ Dan Guo ⋅ Richang Hong
ExHall D Poster #316
HistoFS: Non-IID Histopathologic Whole Slide Image Classification via Federated Style Transfer with RoI-Preserving Poster Session 6
Farchan Hakim Raswa ⋅ Chun-Shien Lu ⋅ Jia-Ching Wang
ExHall D Poster #393
Unified Medical Lesion Segmentation via Self-referring Indicator Poster Session 2
Shijie Chang ⋅ Xiaoqi Zhao ⋅ Lihe Zhang ⋅ Tiancheng Wang
ExHall D Poster #480
VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving Poster Session 4
Haiming Zhang ⋅ Wending Zhou ⋅ Shenzhen The Chinese University of Hongkong ⋅ Hong Kong University of Science and Technology ⋅ Huawei Technologies Ltd. ⋅ Huawei Technologies Ltd. ⋅ Huawei Technologies Ltd. ⋅ Huawei Technologies Ltd. ⋅ Huawei Technologies Ltd. ⋅ Shenzhen The Chinese University of Hong Kong
ExHall D Poster #129
SGSST: Scaling Gaussian Splatting Style Transfer Poster Session 6
Bruno Galerne ⋅ Jianling WANG ⋅ Lara Raad ⋅ Jean-michel Morel
ExHall D Poster #36
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation Poster Session 3
Saksham Singh Kushwaha ⋅ Yapeng Tian
ExHall D Poster #275
Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deformation Poster Session 5
Takeshi Noda ⋅ Chao Chen ⋅ Junsheng Zhou ⋅ Weiqi Zhang ⋅ Yu-Shen Liu ⋅ Zhizhong Han
ExHall D Poster #104
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers Poster Session 4
Haoran You ⋅ Connelly Barnes ⋅ Yuqian Zhou ⋅ Yan Kang ⋅ Zhenbang Du ⋅ Wei Zhou ⋅ Lingzhi Zhang ⋅ Yotam Nitzan ⋅ Xiaoyang Liu ⋅ Zhe Lin ⋅ Eli Shechtman ⋅ Sohrab Amirghodsi ⋅ Yingyan (Celine) Lin
ExHall D Poster #216
Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model Poster Session 4
Haobo Jiang ⋅ Jin Xie ⋅ Jian Yang ⋅ Liang Yu ⋅ Jianmin Zheng
ExHall D Poster #108
Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration Poster Session 6
Jiani Ni ⋅ He Zhao ⋅ Jintong Gao ⋅ Dandan Guo ⋅ Hongyuan Zha
ExHall D Poster #437
DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction Poster Session 6
Junjie Zhou ⋅ Shouju Wang ⋅ Yuxia Tang ⋅ Qi Zhu ⋅ Daoqiang Zhang ⋅ WEI SHAO
ExHall D Poster #453
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Poster Session 5
Lei Li ⋅ wei yuancheng ⋅ Zhihui Xie ⋅ Xuqing Yang ⋅ Yifan Song ⋅ Peiyi Wang ⋅ Chenxin An ⋅ Tianyu Liu ⋅ Sujian Li ⋅ Bill Yuchen Lin ⋅ Lingpeng Kong ⋅ Qi Liu
ExHall D Poster #347
Unveiling Differences in Generative Models: A Scalable Differential Clustering Approach Poster Session 2
Jingwei Zhang ⋅ Mohammad Jalali ⋅ Cheuk Ting Li ⋅ Farzan Farnia
ExHall D Poster #276
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening Poster Session 5
Sungpyo Kim ⋅ Jeonghyeok Do ⋅ Jaehyup Lee ⋅ Munchurl Kim
ExHall D Poster #191
HERA: Hybrid Explicit Representation for Ultra-Realistic Head Avatars Poster Session 1
Hongrui Cai ⋅ Yuting Xiao ⋅ Xuan Wang ⋅ Jiafei Li ⋅ Yudong Guo ⋅ Yanbo Fan ⋅ Shenghua Gao ⋅ Juyong Zhang
ExHall D Poster #9
Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning Poster Session 4
Stefan Smeu ⋅ Dragos-Alexandru Boldisor ⋅ Dan Oneata ⋅ Elisabeta Oneata
ExHall D Poster #289
CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification Poster Session 1
wenlong yu ⋅ Qilong Wang ⋅ Chuang Liu ⋅ Dong Li ⋅ Qinghua Hu
ExHall D Poster #403
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input Poster Session 5
Jian Wang ⋅ Rishabh Dabral ⋅ Diogo Luvizon ⋅ Zhe Cao ⋅ Lingjie Liu ⋅ Thabo Beeler ⋅ Christian Theobalt
ExHall D Poster #153
Learning Extremely High Density Crowds as Active Matters Poster Session 1
Feixiang He ⋅ Jiangbei Yue ⋅ Jialin Zhu ⋅ Armin Seyfried ⋅ Dan Casas ⋅ Julien Pettré ⋅ He Wang
ExHall D Poster #35
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning Poster Session 4
Yuncong Yang ⋅ Han Yang ⋅ Jiachen Zhou ⋅ Peihao Chen ⋅ Hongxin Zhang ⋅ Yilun Du ⋅ Chuang Gan
ExHall D Poster #141
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation Poster Session 2
Rang Meng ⋅ Xingyu Zhang ⋅ Yuming Li ⋅ Chenguang Ma
ExHall D Poster #4
Navigation World Models Poster Session 4
Amir Bar ⋅ Gaoyue Zhou ⋅ Danny Tran ⋅ Trevor Darrell ⋅ Yann LeCun
ExHall D Poster #396
Video Motion Transfer with Diffusion Transformers Poster Session 5
Alexander Pondaven ⋅ Aliaksandr Siarohin ⋅ Sergey Tulyakov ⋅ Philip H.S. Torr ⋅ Fabio Pizzati
ExHall D Poster #176
Unified Reconstruction of Static and Dynamic Scenes from Events Poster Session 6
Qiyao Gao ⋅ Peiqi Duan ⋅ Hanyue Lou ⋅ Minggui Teng ⋅ Ziqi Cai ⋅ Xu Chen ⋅ Boxin Shi
ExHall D Poster #166
Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and Benchmark Poster Session 6
Zhuoran Du ⋅ Shaodi You ⋅ Cheng Cheng ⋅ Shikui Wei
ExHall D Poster #182
Generative Zero-Shot Composed Image Retrieval Poster Session 6
Lan Wang ⋅ Wei Ao ⋅ Vishnu Naresh Boddeti ⋅ Ser-Nam Lim
ExHall D Poster #340
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation Poster Session 3
Sayak Nag ⋅ Udita Ghosh ⋅ Calvin-Khang Ta ⋅ Sarosij Bose ⋅ Jiachen Li ⋅ Amit K. Roy-Chowdhury
ExHall D Poster #99
Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting Poster Session 6
Wei Lin ⋅ Chenyang ZHAO ⋅ Antoni B. Chan
ExHall D Poster #307
Parallel Sequence Modeling via Generalized Spatial Propagation Network Poster Session 1
Hongjun Wang ⋅ Wonmin Byeon ⋅ Jiarui Xu ⋅ Jinwei Gu ⋅ Ka Chun Cheung ⋅ Jan Kautz ⋅ Xiaolong Wang ⋅ Kai Han ⋅ Sifei Liu
ExHall D Poster #413
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments Poster Session 4
Luke Rowe ⋅ Roger Girgis ⋅ Anthony Gosselin ⋅ Liam Paull ⋅ Christopher Pal ⋅ Felix Heide
ExHall D Poster #133
EMOE: Modality-Specific Enhanced Dynamic Emotion Experts Poster Session 3
Yiyang Fang ⋅ Wenke Huang ⋅ Guancheng Wan ⋅ Kehua Su ⋅ Mang Ye
ExHall D Poster #350
JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration Poster Session 5
yunlong lin ⋅ Zixu Lin ⋅ Haoyu Chen ⋅ Panwang Pan ⋅ Chenxin Li ⋅ Sixiang Chen ⋅ Kairun Wen ⋅ Yeying Jin ⋅ Wenbo Li ⋅ Xinghao Ding
ExHall D Poster #126
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting Poster Session 1
Ziyi Wang ⋅ Yanran Zhang ⋅ Jie Zhou ⋅ Jiwen Lu
ExHall D Poster #107
EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion Poster Session 6
Yixing Zhu ⋅ Qing Zhang ⋅ Yitong Wang ⋅ Yongwei Nie ⋅ Wei-Shi Zheng
ExHall D Poster #201
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Poster Session 3
Junha Lee ⋅ Chunghyun Park ⋅ Jaesung Choe ⋅ Yu-Chiang Frank Wang ⋅ Jan Kautz ⋅ Minsu Cho ⋅ Chris Choy
ExHall D Poster #330
T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning Poster Session 3
Seong-Hyeon Hwang ⋅ Minsu Kim ⋅ Steven Euijong Whang
ExHall D Poster #449
Joint Out-of-Distribution Filtering and Data Discovery Active Learning Poster Session 5
Sebastian Schmidt ⋅ Leonard Schenk ⋅ Leo Schwinn ⋅ Stephan Günnemann
ExHall D Poster #444
Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network Poster Session 5
Xingyu Qiu ⋅ Mengying Yang ⋅ Xinghua Ma ⋅ Fanding Li ⋅ Dong Liang ⋅ Gongning Luo ⋅ wei wang ⋅ Kuanquan Wang ⋅ Shuo Li
ExHall D Poster #207
CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes Poster Session 6
ziteng xue ⋅ Mingzhe Guo ⋅ Heng Fan ⋅ Shihui Zhang ⋅ Zhipeng Zhang
ExHall D Poster #120
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding Poster Session 1
Rong Li ⋅ Shijie Li ⋅ Lingdong Kong ⋅ Xulei Yang ⋅ Junwei Liang
ExHall D Poster #337
Linear Attention Modeling for Learned Image Compression Poster Session 2
Donghui Feng ⋅ Zhengxue Cheng ⋅ Shen Wang ⋅ Ronghua Wu ⋅ Hongwei Hu ⋅ Guo Lu ⋅ Li Song
ExHall D Poster #215
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Poster Session 5
Nicolas Dufour ⋅ Vicky Kalogeiton ⋅ David Picard ⋅ Loic Landrieu
ExHall D Poster #186
Asynchronous Collaborative Graph Representation for Frames and Events Poster Session 1
Dianze Li ⋅ Jianing Li ⋅ Xu Liu ⋅ Xiaopeng Fan ⋅ Yonghong Tian
ExHall D Poster #139
Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs Poster Session 6
Youyi Zhan ⋅ Tianjia Shao ⋅ Yin Yang ⋅ Kun Zhou
ExHall D Poster #9
Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration Poster Session 3
Aocheng Li ⋅ James R. Zimmer-Dauphinee ⋅ Rajesh Kalyanam ⋅ Ian Lindsay ⋅ Parker VanValkenburgh ⋅ Steven Wernke ⋅ Daniel Aliaga
ExHall D Poster #107
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks Poster Session 3
Peng Xie ⋅ Yequan Bie ⋅ Jianda Mao ⋅ Yangqiu Song ⋅ Yang Wang ⋅ Hao Chen ⋅ Kani Chen
ExHall D Poster #386
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Poster Session 3
Junjie Wang ⋅ BIN CHEN ⋅ Yulin Li ⋅ Bin Kang ⋅ Yichi Chen ⋅ Zhuotao Tian
ExHall D Poster #399
SocialGesture: Delving into Multi-person Gesture Understanding Poster Session 4
Xu Cao ⋅ Pranav Virupaksha ⋅ Wenqi Jia ⋅ Bolin Lai ⋅ Fiona Ryan ⋅ Sangmin Lee ⋅ James Rehg
ExHall D Poster #353
The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition Poster Session 2
Otto Brookes ⋅ Maksim Kukushkin ⋅ Majid Mirmehdi ⋅ Colleen Stephens ⋅ Paula Dieguez ⋅ Thurston Cleveland Hicks ⋅ Sorrel CZ Jones ⋅ Kevin C. Lee ⋅ Maureen S. McCarthy ⋅ Amelia C. Meier ⋅ NORMAND E. ⋅ Erin G. Wessling ⋅ Roman M. Wittig ⋅ Kevin Langergraber ⋅ Klaus Zuberbühler ⋅ Lukas Boesch ⋅ Thomas Schmid ⋅ Mimi Arandjelovic ⋅ Hjalmar S. Kühl ⋅ Tilo Burghardt
ExHall D Poster #277
Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information Poster Session 4
Hang Shi ⋅ Chi Changxi ⋅ Peng Wan ⋅ Daoqiang Zhang ⋅ WEI SHAO
ExHall D Poster #476
Towards Autonomous Micromobility through Scalable Urban Simulation Poster Session 6
Wayne Wu ⋅ Honglin He ⋅ Chaoyuan Zhang ⋅ Jack He ⋅ Seth Z. Zhao ⋅ Ran Gong ⋅ Quanyi Li ⋅ Bolei Zhou
ExHall D Poster #133
FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation Poster Session 3
Dong Zhao ⋅ Jinlong Li ⋅ Shuang Wang ⋅ Mengyao Wu ⋅ Qi Zang ⋅ Nicu Sebe ⋅ Zhun Zhong
ExHall D Poster #421
Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning Poster Session 5
Na Zheng ⋅ Xuemeng Song ⋅ Xue Dong ⋅ Aashish Nikhil Ghosh ⋅ Liqiang Nie ⋅ Roger Zimmermann
ExHall D Poster #447
DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection Poster Session 2
Li Li ⋅ Huixian Gong ⋅ Hao Dong ⋅ Tiankai Yang ⋅ Zhengzhong Tu ⋅ Yue Zhao
ExHall D Poster #459
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation Poster Session 2
Rohith Peddi ⋅ Saurabh . ⋅ Ayush Abhay Shrivastava ⋅ Parag Singla ⋅ Vibhav Giridhar Gogate
ExHall D Poster #313
Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis Poster Session 4
Woojung Han ⋅ Yeonkyung Lee ⋅ Chanyoung Kim ⋅ Kwanghyun Park ⋅ Seong Jae Hwang
ExHall D Poster #249
Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attack on Breast Ultrasound Images Poster Session 6
Yasamin Medghalchi ⋅ Moein Heidari ⋅ Clayton Allard ⋅ Leonid Sigal ⋅ Ilker Hacihaliloglu
ExHall D Poster #229
Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need Poster Session 1
Qiang Wang ⋅ Xiang Song ⋅ Yuhang He ⋅ Jizhou Han ⋅ Chenhao Ding ⋅ Xinyuan Gao ⋅ Yihong Gong
ExHall D Poster #447
COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection Poster Session 6
Jinqi Xiao ⋅ Shen Sang ⋅ Tiancheng Zhi ⋅ Jing Liu ⋅ Qing Yan ⋅ Linjie Luo ⋅ Bo Yuan
ExHall D Poster #379
EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation Poster Session 2
Md Mostafijur Rahman ⋅ Radu Marculescu
ExHall D Poster #482
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary Poster Session 1
Zeqi Gu ⋅ Yin Cui ⋅ Max Li ⋅ Fangyin Wei ⋅ Yunhao Ge ⋅ Jinwei Gu ⋅ Ming-Yu Liu ⋅ Abe Davis ⋅ Yifan Ding
ExHall D Poster #261
MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data Poster Session 4
Zifan Wang ⋅ Ziqing Chen ⋅ Junyu Chen ⋅ Jilong Wang ⋅ Yuxin Yang ⋅ Yunze Liu ⋅ Xueyi Liu ⋅ He Wang ⋅ Li Yi
ExHall D Poster #143
Improving Sound Source Localization with Joint Slot Attention on Image and Audio Poster Session 1
Inho Kim ⋅ YOUNGKIL SONG ⋅ Jicheol Park ⋅ Won Hwa Kim ⋅ Suha Kwak
ExHall D Poster #283
Feature-Preserving Mesh Decimation for Normal Integration Poster Session 2
Moritz Heep ⋅ Sven Behnke ⋅ Eduard Zell
ExHall D Poster #32
Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body Poster Session 5
Zeqing Wang ⋅ Qingyang Ma ⋅ Wentao Wan ⋅ Haojie Li ⋅ Keze Wang ⋅ Yonghong Tian
ExHall D Poster #17
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Poster Session 6
Yuhui Zhang ⋅ Yuchang Su ⋅ Yiming Liu ⋅ Xiaohan Wang ⋅ James Burgess ⋅ Elaine Sui ⋅ Chenyu Wang ⋅ Josiah Aklilu ⋅ Alejandro Lozano ⋅ Anjiang Wei ⋅ Ludwig Schmidt ⋅ Serena Yeung
ExHall D Poster #327
DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation Poster Session 1
Zhixuan Liang ⋅ Yao Mu ⋅ Yixiao Wang ⋅ Fei Ni ⋅ Tianxing Chen ⋅ Wenqi Shao ⋅ Wei Zhan ⋅ Masayoshi Tomizuka ⋅ Ping Luo ⋅ Mingyu Ding
ExHall D Poster #147
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness Poster Session 2
SeungJu Cha ⋅ Kwanyoung Lee ⋅ Ye-Chan Kim ⋅ Hyunwoo Oh ⋅ Dong-Jin Kim
ExHall D Poster #255
ROLL: Robust Noisy Pseudo-label Learning for Multi-View Clustering with Noisy Correspondence Poster Session 6
Yuan Sun ⋅ Yongxiang Li ⋅ Zhenwen Ren ⋅ Guiduo Duan ⋅ Dezhong Peng ⋅ Peng Hu
ExHall D Poster #439
Calibrated Multi-Preference Optimization for Aligning Diffusion Models Poster Session 4
Kyungmin Lee ⋅ Xiaohang Li ⋅ Qifei Wang ⋅ Junfeng He ⋅ Junjie Ke ⋅ Ming-Hsuan Yang ⋅ Irfan Essa ⋅ Jinwoo Shin ⋅ Feng Yang ⋅ Yinxiao Li
ExHall D Poster #257
Learning from Neighbors: Category Extrapolation for Long-Tail Learning Poster Session 6
Shizhen Zhao ⋅ Xin Wen ⋅ Jiahui Liu ⋅ Chuofan Ma ⋅ Chunfeng Yuan ⋅ Xiaojuan Qi
ExHall D Poster #415
Material Anything: Generating Materials for Any 3D Object via Diffusion Poster Session 6
Xin Huang ⋅ Tengfei Wang ⋅ Ziwei Liu ⋅ Qing Wang
ExHall D Poster #38
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Poster Session 2
Liang Pan ⋅ Zeshi Yang ⋅ Zhiyang Dou ⋅ Wenjia Wang ⋅ Buzhen Huang ⋅ Bo Dai ⋅ Taku Komura ⋅ Jingbo Wang
ExHall D Poster #159
Reversing Flow for Image Restoration Poster Session 2
Haina Qin ⋅ Wenyang Luo ⋅ Bing Li ⋅ Weiming Hu ⋅ libin wang ⋅ DanDan Zheng ⋅ Jingdong Chen ⋅ Ming Yang
ExHall D Poster #208
Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking Poster Session 1
Phuc Nguyen ⋅ Minh Luu ⋅ Anh Tran ⋅ Cuong Pham ⋅ Khoi Nguyen
ExHall D Poster #330
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing Poster Session 1
Yufan Ren ⋅ Zicong Jiang ⋅ Tong Zhang ⋅ Søren Forchhammer ⋅ Sabine Süsstrunk
ExHall D Poster #238
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object Poster Session 1
Zhe Shan ⋅ Yang Liu ⋅ Lei Zhou ⋅ Cheng Yan ⋅ Heng Wang ⋅ Xia Xie
ExHall D Poster #329
InsightEdit: Towards Better Instruction Following for Image Editing Poster Session 1
Yingjing Xu ⋅ Jie Kong ⋅ Jiazhi Wang ⋅ Xiao Pan ⋅ Bo Lin ⋅ Qiang Liu
ExHall D Poster #242
Open-Canopy: Towards Very High Resolution Forest Monitoring Poster Session 1
Fajwel Fogel ⋅ Yohann PERRON ⋅ Nikola Besic ⋅ Laurent Saint-André ⋅ Agnès Pellissier-Tanon ⋅ Thomas Boudras ⋅ Martin Schwartz ⋅ Ibrahim Fayad ⋅ Alexandre d'Aspremont ⋅ Loic Landrieu ⋅ Philippe Ciais
ExHall D Poster #114
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion Poster Session 2
Changan Chen ⋅ Juze Zhang ⋅ Shrinidhi Kowshika Lakshmikanth ⋅ Yusu Fang ⋅ Ruizhi Shao ⋅ Gordon Wetzstein ⋅ Li Fei-Fei ⋅ Ehsan Adeli
ExHall D Poster #73
SINR: Sparsity Driven Compressed Implicit Neural Representations Poster Session 1
Dhananjaya Jayasundara ⋅ Sudarshan Rajagopalan ⋅ Yasiru Ranasinghe ⋅ Trac Tran ⋅ Vishal M. Patel
ExHall D Poster #277
Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models Poster Session 5
Ronghuan Wu ⋅ Wanchao Su ⋅ Jing Liao
ExHall D Poster #254
Spiking Transformer: Introducing Accurate Addition-Only Spiking Self-Attention for Transformer Poster Session 5
Yufei Guo ⋅ Xiaode Liu ⋅ Yuanpei Chen ⋅ Weihang Peng ⋅ Yuhan Zhang ⋅ Zhe Ma
ExHall D Poster #322
S2D-LFE: Sparse-to-Dense Light Field Event Generation Poster Session 3
Yutong Liu ⋅ Wenming Weng ⋅ Yueyi Zhang ⋅ Zhiwei Xiong
ExHall D Poster #53
Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images Poster Session 2
Nan Zhong ⋅ Haoyu Chen ⋅ Yiran Xu ⋅ Zhenxing Qian ⋅ Xinpeng Zhang
ExHall D Poster #275
Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory Poster Session 5
Wenliang Zhong ⋅ Haoyu Tang ⋅ Qinghai Zheng ⋅ Mingzhu Xu ⋅ Yupeng Hu ⋅ Weili Guan
ExHall D Poster #434
FiRe: Fixed-points of Restoration Priors for Solving Inverse Problems Poster Session 5
Matthieu Terris ⋅ Ulugbek Kamilov ⋅ Thomas Moreau
ExHall D Poster #203
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Poster Session 6
Ruijie Lu ⋅ Yixin Chen ⋅ Junfeng Ni ⋅ Baoxiong Jia ⋅ Yu Liu ⋅ Diwen Wan ⋅ Gang Zeng ⋅ Siyuan Huang
ExHall D Poster #60
Dynamic Stereotype Theory Induced Micro-expression Recognition with Oriented Deformation Poster Session 3
Bohao Zhang ⋅ Xuejiao Wang ⋅ Changbo Wang ⋅ Gaoqi He
ExHall D Poster #5
Charm: The Missing Piece in ViT Fine-Tuning for Image Aesthetic Assessment Poster Session 2
Fatemeh Behrad ⋅ Tinne Tuytelaars ⋅ Johan Wagemans
ExHall D Poster #234
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations Poster Session 5
Haitong Liu ⋅ Kuofeng Gao ⋅ Yang Bai ⋅ Jinmin Li ⋅ Jinxiao Shan ⋅ Tao Dai ⋅ Shu-Tao Xia
ExHall D Poster #290
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures Poster Session 6
Hui Liu ⋅ Chen Jia ⋅ Fan Shi ⋅ Xu Cheng ⋅ Shengyong Chen
ExHall D Poster #311
MMRL: Multi-Modal Representation Learning for Vision-Language Models Poster Session 5
Yuncheng Guo ⋅ Xiaodong Gu
ExHall D Poster #380
Parallelized Autoregressive Visual Generation Poster Session 3
Yuqing Wang ⋅ Shuhuai Ren ⋅ Zhijie Lin ⋅ Yujin Han ⋅ Haoyuan Guo ⋅ Zhenheng Yang ⋅ Difan Zou ⋅ Jiashi Feng ⋅ Xihui Liu
ExHall D Poster #220
PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation Poster Session 1
Zidong Cao ⋅ Jinjing Zhu ⋅ Weiming Zhang ⋅ Hao Ai ⋅ Haotian Bai ⋅ Hengshuang Zhao ⋅ Lin Wang
ExHall D Poster #76
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data Poster Session 2
Runjian Chen ⋅ Wenqi Shao ⋅ Bo Zhang ⋅ Shaoshuai Shi ⋅ Li Jiang ⋅ Ping Luo
ExHall D Poster #136
One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion Poster Session 6
Chunyang Cheng ⋅ Tianyang Xu ⋅ Zhenhua Feng ⋅ Xiaojun Wu ⋅ Zhangyong Tang ⋅ Hui Li ⋅ Zhang Zeyang ⋅ Sara Atito ⋅ Muhammad Awais ⋅ Josef Kittler
ExHall D Poster #184
Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation Poster Session 4
Pu Cao ⋅ Feng Zhou ⋅ Lu Yang ⋅ TianruiHuang ⋅ Qing Song
ExHall D Poster #244
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Poster Session 6
Ziyang Zhang ⋅ Yang Yu ⋅ Yucheng Chen ⋅ Xulei Yang ⋅ Si Yong Yeo
ExHall D Poster #345
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders Poster Session 1
Ziqi Pang ⋅ Tianyuan Zhang ⋅ Fujun Luan ⋅ Yunze Man ⋅ Hao Tan ⋅ Kai Zhang ⋅ William Freeman ⋅ Yu-Xiong Wang
ExHall D Poster #222
Learning to Highlight Audio by Watching Movies Poster Session 5
Chao Huang ⋅ Ruohan Gao ⋅ J. M. F. Tsang ⋅ Jan Kurcius ⋅ Cagdas Bilen ⋅ Chenliang Xu ⋅ Anurag Kumar ⋅ Sanjeel Parekh
ExHall D Poster #278
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos Poster Session 2
Prithviraj Banerjee ⋅ Sindi Shkodrani ⋅ Pierre Moulon ⋅ Shreyas Hampali ⋅ Shangchen Han ⋅ Fan Zhang ⋅ Linguang Zhang ⋅ Jade Fountain ⋅ Edward Miller ⋅ Selen Basol ⋅ Richard Newcombe ⋅ Robert Wang ⋅ Jakob Engel ⋅ Tomas Hodan
ExHall D Poster #163
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding Poster Session 2
Duo Zheng ⋅ Shijia Huang ⋅ Liwei Wang
ExHall D Poster #347
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation Poster Session 2
Yudi Shi ⋅ Shangzhe Di ⋅ Qirui Chen ⋅ Weidi Xie
ExHall D Poster #300
RobSense: A Robust Multi-modal Foundation Model for Remote Sensing with Static, Temporal, and Incomplete Data Adaptability Poster Session 2
Minh Kha Do ⋅ Kang Han ⋅ Phu Lai ⋅ Khoa T. Phan ⋅ Wei Xiang
ExHall D Poster #197
Joint Scheduling of Causal Prompts and Tasks for Multi-Task Learning Poster Session 5
Chaoyang Li ⋅ Jianyang Qin ⋅ Jinhao Cui ⋅ Zeyu Liu ⋅ Ning Hu ⋅ Qing Liao
ExHall D Poster #390
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models Poster Session 2
Zhihang Liu ⋅ Chen-Wei Xie ⋅ Pandeng Li ⋅ Liming Zhao ⋅ Longxiang Tang ⋅ Yun Zheng ⋅ Chuanbin Liu ⋅ Hongtao Xie
ExHall D Poster #305
Unified Dense Prediction of Video Diffusion Poster Session 6
Lehan Yang ⋅ Lu Qi ⋅ Xiangtai Li ⋅ Sheng Li ⋅ Varun Jampani ⋅ Ming-Hsuan Yang
ExHall D Poster #269
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Poster Session 1
Siyuan Bian ⋅ Chenghao Xu ⋅ Yuliang Xiu ⋅ Artur Grigorev ⋅ Zhen Liu ⋅ Cewu Lu ⋅ Michael J. Black ⋅ Yao Feng
ExHall D Poster #264
Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking Poster Session 2
chaocan xue ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Yaozong Zheng ⋅ Ning Li ⋅ Yuanliang Xue ⋅ Shuxiang Song
ExHall D Poster #130
PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description Poster Session 2
Ziqi Cai ⋅ Shuchen Weng ⋅ Yifei Xia ⋅ Boxin Shi
ExHall D Poster #239
Single Domain Generalization for Few-Shot Counting via Universal Representation Matching Poster Session 1
Xianing Chen ⋅ Si Huo ⋅ Borui Jiang ⋅ Hailin Hu ⋅ Xinghao Chen
ExHall D Poster #428
FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors Poster Session 6
Changlong Shi ⋅ He Zhao ⋅ Bingjie Zhang ⋅ Mingyuan Zhou ⋅ Dandan Guo ⋅ Yi Chang
ExHall D Poster #431
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training Poster Session 5
Luo ⋅ Xue Yang ⋅ Wenhan Dou ⋅ Zhaokai Wang ⋅ Jiawen Liu ⋅ Jifeng Dai ⋅ Yu Qiao ⋅ Xizhou Zhu
ExHall D Poster #375
Instruction-based Image Manipulation by Watching How Things Move Poster Session 1
Mingdeng Cao ⋅ Xuaner Zhang ⋅ Yinqiang Zheng ⋅ Zhihao Xia
ExHall D Poster #243
Embodied Scene Understanding for Vision Language Models via MetaVQA Poster Session 5
Weizhen Wang ⋅ Chenda Duan ⋅ Zhenghao Peng ⋅ Yuxin Liu ⋅ Bolei Zhou
ExHall D Poster #133
PEER Pressure: Model-to-Model Regularization for Single Source Domain Generalization Poster Session 3
Dongkyu Cho ⋅ Inwoo Hwang ⋅ Sanghack Lee
ExHall D Poster #451
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks Poster Session 6
Han Wang ⋅ Gang Wang ⋅ Huan Zhang
ExHall D Poster #363
Functionality Understanding and Segmentation in 3D Scenes Poster Session 5
Jaime Corsetti ⋅ Francesco Giuliari ⋅ Alice Fasoli ⋅ Davide Boscaini ⋅ Fabio Poiesi
ExHall D Poster #336
Less is More: Efficient Image Vectorization with Adaptive Parameterization Poster Session 4
Kaibo Zhao ⋅ Liang Bao ⋅ Yufei Li ⋅ Xu Su ⋅ Ke Zhang ⋅ Xiaotian Qiao
ExHall D Poster #225
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image Poster Session 4
Shaoming Li ⋅ Qing Cai ⋅ Songqi KONG ⋅ Runqing Tan ⋅ Heng Tong ⋅ Shiji Qiu ⋅ Yongguo Jiang ⋅ Zhi Liu
ExHall D Poster #105
Probing the Mid-level Vision Capabilities of Self-Supervised Learning Poster Session 6
Xuweiyi Chen ⋅ Markus Marks ⋅ Zezhou Cheng
ExHall D Poster #377
Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection Poster Session 3
Zihao Zhang ⋅ Aming Wu ⋅ Yahong Han
ExHall D Poster #342
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation Poster Session 6
Chuandong Liu ⋅ Xingxing Weng ⋅ Shuguo Jiang ⋅ Pengcheng Li ⋅ Lei Yu ⋅ Gui-Song Xia
ExHall D Poster #117
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Poster Session 2
Minghong Cai ⋅ Xiaodong Cun ⋅ Xiaoyu Li ⋅ Wenze Liu ⋅ Zhaoyang Zhang ⋅ Yong Zhang ⋅ Ying Shan ⋅ Xiangyu Yue
ExHall D Poster #229
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation Poster Session 5
Shahad Albastaki ⋅ Anabia Sohail ⋅ IYYAKUTTI IYAPPAN GANAPATHI ⋅ Basit Alawode ⋅ Asim Khan ⋅ Sajid Javed ⋅ Naoufel Werghi ⋅ Mohammed Bennamoun ⋅ Arif Mahmood
ExHall D Poster #468
SpecTRe-GS: Modeling Highly Specular Surfaces with Reflected Nearby Objects by Tracing Rays in 3D Gaussian Splatting Poster Session 4
Jiajun Tang ⋅ Fan Fei ⋅ Zhihao Li ⋅ Xiao Tang ⋅ Shiyong Liu ⋅ Youyu Chen ⋅ Binxiao Huang ⋅ Dave Zhenyu Chen ⋅ Xiaofei Wu ⋅ Boxin Shi
ExHall D Poster #27
T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving Poster Session 4
Changsheng Lv ⋅ Mengshi Qi ⋅ Liang Liu ⋅ Huadong Ma
ExHall D Poster #132
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning Poster Session 6
Quanxing Zha ⋅ Xin Liu ⋅ Shu-Juan Peng ⋅ Yiu-ming Cheung ⋅ Xing Xu ⋅ Nannan Wang
ExHall D Poster #338
ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation Poster Session 1
Zirun Guo ⋅ Tao Jin
ExHall D Poster #266
AKiRa: Augmentation Kit on Rays for Optical Video Generation Poster Session 1
Xi Wang ⋅ Robin Courant ⋅ Marc Christie ⋅ Vicky Kalogeiton
ExHall D Poster #234
RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects Poster Session 2
Soumyaratna Debnath ⋅ Ashish Tiwari ⋅ Kaustubh Sadekar ⋅ Shanmuganathan Raman
ExHall D Poster #38
Exploring Historical Information for RGBE Visual Tracking with Mamba Poster Session 2
Chuanyu Sun ⋅ Jiqing Zhang ⋅ Yang Wang ⋅ Huilin Ge ⋅ qianchen xia ⋅ Baocai Yin ⋅ Xin Yang
ExHall D Poster #107
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation Poster Session 4
Tal Zeevi ⋅ Ravid Shwartz-Ziv ⋅ Yann LeCun ⋅ Lawrence Staib ⋅ John A Onofrey
ExHall D Poster #471
MP-GUI: Modality Perception with MLLMs for GUI Understanding Poster Session 6
Ziwei Wang ⋅ Weizhi Chen ⋅ Leyang Yang ⋅ Sheng Zhou ⋅ Shengchu Zhao ⋅ Hanbei Zhan ⋅ Jiongchao Jin ⋅ Liangcheng Li ⋅ Zirui Shao ⋅ Jiajun Bu
ExHall D Poster #342
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding Poster Session 6
Han Xiao ⋅ yina xie ⋅ Guanxin tan ⋅ Yinghao Chen ⋅ Rui Hu ⋅ Ke Wang ⋅ Aojun Zhou ⋅ Hao Li ⋅ Hao Shao ⋅ Xudong LU ⋅ Peng Gao ⋅ Yafei Wen ⋅ Xiaoxin Chen ⋅ Shuai Ren ⋅ Hongsheng Li
ExHall D Poster #325
Identifying and Mitigating Spurious Correlation in Multi-Task Learning Poster Session 5
Junyi Chai ⋅ Shenyu Lu ⋅ Xiaoqian Wang
ExHall D Poster #446
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention Poster Session 6
Wenbin An ⋅ Feng Tian ⋅ Sicong Leng ⋅ Jiahao Nie ⋅ Haonan Lin ⋅ QianYing Wang ⋅ Ping Chen ⋅ Xiaoqin Zhang ⋅ Shijian Lu
ExHall D Poster #360
Sketchy Bounding-box Supervision for 3D Instance Segmentation Poster Session 2
qian deng ⋅ Le Hui ⋅ Jin Xie ⋅ Jian Yang
ExHall D Poster #336
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions Poster Session 3
Stefan Andreas Baumann ⋅ Felix Krause ⋅ Michael Neumayr ⋅ Nick Stracke ⋅ Melvin Sevi ⋅ Vincent Tao Hu ⋅ Björn Ommer
ExHall D Poster #246
Towards Explainable and Unprecedented Accuracy in Matching Challenging Finger Crease Patterns Poster Session 2
Zhenyu Zhou ⋅ Chengdong Dong ⋅ Ajay Kumar
ExHall D Poster #74
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining Poster Session 1
Shanglin Liu ⋅ Jianming Lv ⋅ Jingdan Kang ⋅ Huaidong Zhang ⋅ Zequan Liang ⋅ Shengfeng He
ExHall D Poster #471
Towards Universal Soccer Video Understanding Poster Session 2
Jiayuan Rao ⋅ Haoning Wu ⋅ Hao Jiang ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Weidi Xie
ExHall D Poster #288
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment Poster Session 3
Katrin Renz ⋅ Long Chen ⋅ Elahe Arani ⋅ Oleg Sinavski
ExHall D Poster #130
Prior-free 3D Object Tracking Poster Session 1
Xiuqiang Song ⋅ Li Jin ⋅ Zhengxian Zhang ⋅ Jiachen Li ⋅ Fan Zhong ⋅ Guofeng Zhang ⋅ Xueying Qin
ExHall D Poster #96
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation Poster Session 1
Bingjie Gao ⋅ Xinyu Gao ⋅ Xiaoxue Wu ⋅ yujie zhou ⋅ Yu Qiao ⋅ Li Niu ⋅ Xinyuan Chen ⋅ Yaohui Wang
ExHall D Poster #288
Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration Poster Session 2
Yiyang Chen ⋅ Tianyu Ding ⋅ Lei Wang ⋅ Jing Huo ⋅ Yang Gao ⋅ Wenbin Li
ExHall D Poster #429
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs Poster Session 4
Lucas Ventura ⋅ Antoine Yang ⋅ Cordelia Schmid ⋅ Gul Varol
ExHall D Poster #301
SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting Poster Session 3
Jiahui Zhang ⋅ Fangneng Zhan ⋅ Ling Shao ⋅ Shijian Lu
ExHall D Poster #49
VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction Poster Session 2
Ziyue Zhu ⋅ Shenlong Wang ⋅ Jin Xie ⋅ Jiang-Jiang Liu ⋅ Jingdong Wang ⋅ Jian Yang
ExHall D Poster #133
Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution Poster Session 2
Shijun Shi ⋅ Jing Xu ⋅ Lijing Lu ⋅ Zhihang Li ⋅ Kai Hu
ExHall D Poster #193
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Poster Session 6
Yao Mu ⋅ Tianxing Chen ⋅ Zanxin Chen ⋅ ShijiaPeng ⋅ Zhiqian Lan ⋅ Zeyu Gao ⋅ Zhixuan Liang ⋅ Qiaojun Yu ⋅ Yude Zou ⋅ Mingkun Xu ⋅ Lunkai Lin ⋅ Zhiqiang Xie ⋅ Mingyu Ding ⋅ Ping Luo
ExHall D Poster #142
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant Poster Session 1
Wei Li ⋅ Bing Hu ⋅ Rui Shao ⋅ Leyang Shen ⋅ Liqiang Nie
ExHall D Poster #294
Preconditioners for the Stochastic Training of Neural Fields Poster Session 6
Shin-Fang Chng ⋅ Hemanth Saratchandran ⋅ Simon Lucey
ExHall D Poster #102
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning Poster Session 1
Cheng Chen ⋅ Yunpeng Zhai ⋅ Yifan Zhao ⋅ Jinyang Gao ⋅ Bolin Ding ⋅ Jia Li
ExHall D Poster #348
VinaBench: Benchmark for Faithful and Consistent Visual Narratives Poster Session 1
Silin Gao ⋅ Sheryl Mathew ⋅ Li Mi ⋅ Sepideh Mamooler ⋅ Mengjie Zhao ⋅ Hiromi Wakaki ⋅ Yuki Mitsufuji ⋅ Syrielle Montariol ⋅ Antoine Bosselut
ExHall D Poster #259
FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution Poster Session 6
Junyang Chen ⋅ Jinshan Pan ⋅ Jiangxin Dong
ExHall D Poster #193
Learned Image Compression with Dictionary-based Entropy Model Poster Session 3
Jingbo Lu ⋅ Leheng Zhang ⋅ Xingyu Zhou ⋅ Mu Li ⋅ Wen Li ⋅ Shuhang Gu
ExHall D Poster #210
ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation Poster Session 4
Tao Tan ⋅ Qiulei Dong
ExHall D Poster #96
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance Poster Session 1
Xin Ye ⋅ Burhan Yaman ⋅ Sheng Cheng ⋅ Feng Tao ⋅ Abhirup Mallik ⋅ Liu Ren
ExHall D Poster #124
VoCo-LLaMA: Towards Vision Compression with Large Language Models Poster Session 6
Xubing Ye ⋅ Yukang Gan ⋅ Xiaoke Huang ⋅ Yixiao Ge ⋅ Yansong Tang
ExHall D Poster #353
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Poster Session 2
Jierun Chen ⋅ Dongting Hu ⋅ Xijie Huang ⋅ Huseyin Coskun ⋅ Arpit Sahni ⋅ Aarush Gupta ⋅ Anujraaj Goyal "argo" ⋅ Dishani Lahiri ⋅ Rajesh Singh ⋅ Yerlan Idelbayev ⋅ Junli Cao ⋅ Yanyu Li ⋅ Kwang-Ting Cheng ⋅ Mingming Gong ⋅ S.-H. Gary Chan ⋅ Sergey Tulyakov ⋅ Anil Kag ⋅ Yanwu Xu ⋅ Jian Ren
ExHall D Poster #251
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation Poster Session 5
Yuanqi Yao ⋅ Siao Liu ⋅ Haoming Song ⋅ Delin Qu ⋅ Qizhi Chen ⋅ Yan Ding ⋅ Bin Zhao ⋅ Zhigang Wang ⋅ Dong Wang ⋅ Xuelong Li
ExHall D Poster #144
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement Poster Session 1
Qianhan Feng ⋅ Wenshuo Li ⋅ Tong Lin ⋅ Xinghao Chen
ExHall D Poster #383
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Poster Session 6
Joya Chen ⋅ Yiqi Lin ⋅ Ziyun Zeng ⋅ Wei Li ⋅ Zejun Ma ⋅ Mike Zheng Shou
ExHall D Poster #281
ETAP: Event-based Tracking of Any Point Poster Session 6
Friedhelm Hamann ⋅ Daniel Gehrig ⋅ Filbert Febryanto ⋅ Kostas Daniilidis ⋅ Guillermo Gallego
ExHall D Poster #99
Feature Spectrum Learning for Remote Sensing Change Detection Poster Session 3
Qi Zang ⋅ Dong Zhao ⋅ Shuang Wang ⋅ Dou Quan ⋅ Licheng Jiao ⋅ Zhun Zhong
ExHall D Poster #191
Free Lunch Enhancements for Multi-modal Crowd Counting Poster Session 3
Haoliang Meng ⋅ Xiaopeng Hong ⋅ Zhengqin Lai ⋅ Miao Shang
ExHall D Poster #322
LiSu: A Dataset and Method for LiDAR Surface Normal Estimation Poster Session 4
Dušan Malić ⋅ Christian Fruhwirth-Reisinger ⋅ Samuel Schulter ⋅ Horst Possegger
ExHall D Poster #117
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation Poster Session 1
Xingguang Zhang ⋅ Nicholas M Chimitt ⋅ Xijun Wang ⋅ Yu Yuan ⋅ Stanley H. Chan
ExHall D Poster #184
Adapting to Observation Length of Trajectory Prediction via Contrastive Learning Poster Session 1
Ruiqi Qiu ⋅ JUN GONG ⋅ Xinyu Zhang ⋅ Siqi Luo ⋅ Bowen Zhang ⋅ Yi Cen
ExHall D Poster #138
GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping Poster Session 2
Jinfeng Liu ⋅ Lingtong Kong ⋅ Bo Li ⋅ Dan Xu
ExHall D Poster #52
TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing Poster Session 6
Stefan Lionar ⋅ Jiabin Liang ⋅ Gim Hee Lee
ExHall D Poster #43
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training? Poster Session 5
Yuechen Xie ⋅ Jie Song ⋅ Huiqiong Wang ⋅ Mingli Song
ExHall D Poster #268
vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation Poster Session 4
Bastian Wittmann ⋅ Yannick Wattenberg ⋅ Tamaz Amiranashvili ⋅ Suprosanna Shit ⋅ Bjoern Menze
ExHall D Poster #482
DistinctAD: Distinctive Audio Description Generation in Contexts Poster Session 3
Bo Fang ⋅ Wenhao Wu ⋅ Qiangqiang Wu ⋅ YuXin Song ⋅ Antoni B. Chan
ExHall D Poster #279
HOT: Hadamard-based Optimized Training Poster Session 1
Seonggon Kim ⋅ Juncheol Shin ⋅ Seung-taek Woo ⋅ Eunhyeok Park
ExHall D Poster #442
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input Poster Session 1
Zhen Lv ⋅ Yangqi Long ⋅ Congzhentao Huang ⋅ Cao Li ⋅ Chengfei Lv ⋅ Hao Ren ⋅ Dian Zheng
ExHall D Poster #60
From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models Poster Session 1
German Barquero ⋅ Nadine Bertsch ⋅ Manojkumar Marramreddy ⋅ Carlos Chacón ⋅ Filippo Arcadu ⋅ Ferran Rigual ⋅ Nicky Sijia He ⋅ Cristina Palmero ⋅ Sergio Escalera ⋅ Yuting Ye ⋅ Robin Kips
ExHall D Poster #156
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement Poster Session 5
Yuchen Ren ⋅ Zhengyu Zhao ⋅ Chenhao Lin ⋅ Bo Yang ⋅ Lu Zhou ⋅ Zhe Liu ⋅ Chao Shen
ExHall D Poster #385
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Poster Session 5
Sherwin Bahmani ⋅ Ivan Skorokhodov ⋅ Guocheng Qian ⋅ Aliaksandr Siarohin ⋅ Willi Menapace ⋅ Andrea Tagliasacchi ⋅ David B. Lindell ⋅ Sergey Tulyakov
ExHall D Poster #173
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks Poster Session 5
Junying Wang ⋅ Hongyuan Zhang ⋅ Yuan Yuan
ExHall D Poster #259
Shadow Generation Using Diffusion Model with Geometry Prior Poster Session 2
Haonan Zhao ⋅ Qingyang Liu ⋅ Xinhao Tao ⋅ Li Niu ⋅ Guangtao Zhai
ExHall D Poster #213
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Poster Session 3
Zhenyi Lu ⋅ Xiaoye Qu ⋅ Zhenyi Lu ⋅ Wei Wei ⋅ Sichen Liu ⋅ Yu Cheng
ExHall D Poster #177
Visual Agentic AI for Spatial Reasoning with a Dynamic API Poster Session 4
Damiano Marsili ⋅ Rohun Agrawal ⋅ Yisong Yue ⋅ Georgia Gkioxari
ExHall D Poster #347
RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges Poster Session 6
Thibaut Loiseau ⋅ Guillaume Bourmaud
ExHall D Poster #88
Star with Bilinear Mapping Poster Session 5
Zelin Peng ⋅ Yu Huang ⋅ Zhengqin Xu ⋅ feilong tang ⋅ Ming Hu ⋅ Xiaokang Yang ⋅ Wei Shen
ExHall D Poster #406
Efficient Personalization of Quantized Diffusion Model without Backpropagation Poster Session 2
Hoigi Seo ⋅ Wongi Jeong ⋅ Kyungryeol Lee ⋅ Se Young Chun
ExHall D Poster #225
M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings Poster Session 3
Qingzheng Xu ⋅ Ru Cao ⋅ Xin Shen ⋅ Heming Du ⋅ Sen Wang ⋅ Xin Yu
ExHall D Poster #157
The Illusion of Unlearning: The Unstable Nature of Machine Unlearning in Text-to-Image Diffusion Models Poster Session 3
Naveen George ⋅ Karthik Nandan Dasaraju ⋅ Rutheesh Reddy Chittepu ⋅ Konda Reddy Mopuri
ExHall D Poster #261
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer Poster Session 6
Hongda Liu ⋅ Longguang Wang ⋅ Ye Zhang ⋅ Ziru YU ⋅ Yulan Guo
ExHall D Poster #220
Auto-Encoded Supervision for Perceptual Image Super-Resolution Poster Session 4
MinKyu Lee ⋅ Sangeek Hyun ⋅ Woojin Jun ⋅ Jae-Pil Heo
ExHall D Poster #205
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model Poster Session 6
Zhenglin Huang ⋅ Jinwei Hu ⋅ Yiwei He ⋅ Xiangtai Li ⋅ Xiaowei Huang ⋅ Bei Peng ⋅ Xingyu Zhao ⋅ Baoyuan Wu ⋅ Guangliang Cheng
ExHall D Poster #254
One2Any: One-Reference 6D Pose Estimation for Any Object Poster Session 2
Mengya Liu ⋅ Siyuan Li ⋅ Ajad Chhatkuli ⋅ Prune Truong ⋅ Luc Van Gool ⋅ Federico Tombari
ExHall D Poster #103
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation Poster Session 6
Andrew Z Wang ⋅ Songwei Ge ⋅ Tero Karras ⋅ Ming-Yu Liu ⋅ Yogesh Balaji
ExHall D Poster #230
PreciseCam: Precise Camera Control for Text-to-Image Generation Poster Session 1
Edurne Bernal-Berdun ⋅ Ana Serrano ⋅ Belen Masia ⋅ Matheus Gadelha ⋅ Yannick Hold-Geoffroy ⋅ Xin Sun ⋅ Diego Gutierrez
ExHall D Poster #246
EventGPT: Event Stream Understanding with Multimodal Large Language Models Poster Session 6
shaoyu liu ⋅ Jianing Li ⋅ guanghui zhao ⋅ Yunjian Zhang ⋅ Xin Meng ⋅ Fei Richard Yu ⋅ Xiangyang Ji ⋅ Ming Li
ExHall D Poster #286
Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors Poster Session 5
Weilong Yan ⋅ Ming Li ⋅ Li Haipeng ⋅ Shuwei Shao ⋅ Robby T. Tan
ExHall D Poster #79
From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective Poster Session 4
Chen Zhao ⋅ Zhizhou Chen ⋅ Yunzhe Xu ⋅ Enxuan Gu ⋅ Jian Li ⋅ Zili Yi ⋅ qian Wang ⋅ Jian Yang ⋅ Ying Tai
ExHall D Poster #203
Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification Poster Session 5
Rui Gong ⋅ Kim-Hui Yap ⋅ Weide Liu ⋅ Xulei Yang ⋅ Jun Cheng
ExHall D Poster #124
Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization Poster Session 2
Jamie Wynn ⋅ Zawar Qureshi ⋅ Jakub Powierza ⋅ Jamie Watson ⋅ Mohamed Sayed
ExHall D Poster #235
Leveraging Global Stereo Consistency for Category-Level Shape and 6D Pose Estimation from Stereo Images Poster Session 4
Junning Qiu ⋅ Minglei Lu ⋅ Fei Wang ⋅ Yu Guo ⋅ Yonggen Ling
ExHall D Poster #97
EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models Poster Session 2
Yinan Liang ⋅ Ziwei Wang ⋅ Xiuwei Xu ⋅ Jie Zhou ⋅ Jiwen Lu
ExHall D Poster #388
Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection Poster Session 3
Houzhang Fang ⋅ Xiaolin Wang ⋅ Zengyang Li ⋅ Lu Wang ⋅ Qingshan Li ⋅ Yi Chang ⋅ Luxin Yan
ExHall D Poster #121
Articulated Kinematics Distillation from Video Diffusion Models Poster Session 4
Xuan Li ⋅ Qianli Ma ⋅ Tsung-Yi Lin ⋅ Yongxin Chen ⋅ Chenfanfu Jiang ⋅ Ming-Yu Liu ⋅ Donglai Xiang
ExHall D Poster #169
MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion Poster Session 5
Zador Pataki ⋅ Paul-Edouard Sarlin ⋅ Johannes Schönberger ⋅ Marc Pollefeys
ExHall D Poster #80
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging Poster Session 1
Yijie Tang ⋅ Jiazhao Zhang ⋅ Yuqing Lan ⋅ Yulan Guo ⋅ Dezun Dong ⋅ Chenyang Zhu ⋅ Kai Xu
ExHall D Poster #334
Tora: Trajectory-oriented Diffusion Transformer for Video Generation Poster Session 1
Zhenghao Zhang ⋅ Junchao Liao ⋅ Menghao Li ⋅ Zuozhuo Dai ⋅ Bingxue Qiu ⋅ Siyu Zhu ⋅ Long Qin ⋅ Weizhi Wang
ExHall D Poster #178
Volumetrically Consistent 3D Gaussian Rasterization Poster Session 3
Chinmay Talegaonkar ⋅ Yash Belhe ⋅ Ravi Ramamoorthi ⋅ Nicholas Antipa
ExHall D Poster #28
DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation Poster Session 6
Mu Chen ⋅ Liulei Li ⋅ Wenguan Wang ⋅ Yi Yang
ExHall D Poster #288
Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels Poster Session 4
Jiyuan Liu ⋅ Xinwang Liu ⋅ chuankun Li ⋅ Xinhang Wan ⋅ Hao Tan ⋅ Yi Zhang ⋅ Weixuan Liang ⋅ Qian Qu ⋅ Yu Feng ⋅ Renxiang Guan ⋅ KE LIANG
ExHall D Poster #468
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection Poster Session 2
Boyong He ⋅ Yuxiang Ji ⋅ Qianwen Ye ⋅ Zhuoyue Tan ⋅ Liaoni Wu
ExHall D Poster #433
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content Poster Session 3
Zicheng Zhang ⋅ Tengchuan Kou ⋅ Chunyi Li ⋅ Shushi Wang ⋅ Wei Sun ⋅ Wei Wang ⋅ Xiaoyu Li ⋅ ZongYu Wang ⋅ Xuezhi Cao ⋅ Xiongkuo Min ⋅ Xiaohong Liu ⋅ Guangtao Zhai
ExHall D Poster #358
Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation Poster Session 3
Zhuoran ZHAO ⋅ Linlin Yang ⋅ Pengzhan Sun ⋅ Pan Hui ⋅ Angela Yao
ExHall D Poster #154
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization Poster Session 1
Junying Wang ⋅ Jingyuan Liu ⋅ Xin Sun ⋅ Krishna Kumar Singh ⋅ ZHIXIN SHU ⋅ He Zhang ⋅ Jimei Yang ⋅ Nanxuan Zhao ⋅ Tuanfeng Y. Wang ⋅ Simon Su Chen ⋅ Ulrich Neumann ⋅ Jae Shin Yoon
ExHall D Poster #20
Hyperspectral Pansharpening via Diffusion Models with Iteratively Zero-Shot Guidance Poster Session 3
Jin-Liang Xiao ⋅ Ting-Zhu Huang ⋅ Liang-Jian Deng ⋅ Guang Lin ⋅ Zihan Cao ⋅ Chao Li ⋅ Qibin Zhao
ExHall D Poster #193
Efficient Motion-Aware Video MLLM Poster Session 5
Zijia Zhao ⋅ Yuqi Huo ⋅ Tongtian Yue ⋅ Longteng Guo ⋅ Haoyu Lu ⋅ Bingning Wang ⋅ Weipeng Chen ⋅ Jing Liu
ExHall D Poster #300
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering Poster Session 3
Jingzhou Luo ⋅ Yang Liu ⋅ weixing chen ⋅ Zhen Li ⋅ Yaowei Wang ⋅ Guanbin Li ⋅ Liang Lin
ExHall D Poster #337
Zero-Shot 4D Lidar Panoptic Segmentation Poster Session 5
Yushan Zhang ⋅ Aljoša Ošep ⋅ Laura Leal-Taixe ⋅ Tim Meinhardt
ExHall D Poster #332
MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views Poster Session 2
Antoine Guédon ⋅ Tomoki Ichikawa ⋅ Kohei Yamashita ⋅ Ko Nishino
ExHall D Poster #53
Extreme Rotation Estimation in the Wild Poster Session 1
Hana Bezalel ⋅ Dotan Ankri ⋅ Ruojin Cai ⋅ Hadar Averbuch-Elor
ExHall D Poster #83
ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation Poster Session 6
Yushan Lai ⋅ Guowen Li ⋅ Haoyuan Liang ⋅ Juepeng Zheng ⋅ Zhiyu Ye
ExHall D Poster #425
Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model Poster Session 1
Yuhan Wang ⋅ Suzhi Bi ⋅ Ying-Jun Angela Zhang ⋅ Xiaojun Yuan
ExHall D Poster #208
IceDiff: High Resolution and High-Quality Arctic Sea Ice Forecasting with Generative Diffusion Prior Poster Session 3
Jingyi Xu ⋅ Siwei Tu ⋅ Weidong Yang ⋅ Ben Fei ⋅ Shuhao Li ⋅ Keyi Liu ⋅ Yeqi Luo ⋅ Lipeng Ma ⋅ Lei Bai
ExHall D Poster #184
DTOS: Dynamic Time Object Sensing with Large Multimodal Model Poster Session 3
Jirui Tian ⋅ Jinrong Zhang ⋅ Shenglan Liu ⋅ Luhao Xu ⋅ Zhixiong Huang ⋅ Gao Huang
ExHall D Poster #302
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation Poster Session 4
Zhiwei Yang ⋅ Yucong Meng ⋅ Kexue Fu ⋅ feilong tang ⋅ Shuo Wang ⋅ Zhijian Song
ExHall D Poster #421
UNIALIGN: Scaling Multimodal Alignment within One Unified Model Poster Session 6
bo zhou ⋅ Liulei Li ⋅ Yujia Wang ⋅ 刘华峰 Liu ⋅ Yazhou Yao ⋅ Wenguan Wang
ExHall D Poster #335
ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions Poster Session 6
Tomas Soucek ⋅ Prajwal Gatti ⋅ Michael Wray ⋅ Ivan Laptev ⋅ Dima Damen ⋅ Josef Sivic
ExHall D Poster #122
Exploration-Driven Generative Interactive Environments Poster Session 6
Nedko Savov ⋅ Naser Kazemi ⋅ Mohammad Mahdi ⋅ Danda Paudel ⋅ Xi Wang ⋅ Luc Van Gool
ExHall D Poster #137
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection Poster Session 2
Rishubh Parihar ⋅ Srinjay Sarkar ⋅ Sarthak Vora ⋅ Jogendra Kundu Kundu ⋅ R. Venkatesh Babu
ExHall D Poster #110
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation Poster Session 3
Boyuan Wang ⋅ Xiaofeng Wang ⋅ Chaojun Ni ⋅ Guosheng Zhao ⋅ Zhiqin Yang ⋅ Zheng Zhu ⋅ Muyang Zhang ⋅ YuKun Zhou ⋅ xinze chen ⋅ Guan Huang ⋅ lihong liu ⋅ Xingang Wang
ExHall D Poster #166
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos Poster Session 4
Tanveer Hannan ⋅ Md Mohaiminul Islam ⋅ Jindong Gu ⋅ Thomas Seidl ⋅ Gedas Bertasius
ExHall D Poster #307
ArtiFade: Learning to Generate High-quality Subject from Blemished Images Poster Session 3
Shuya Yang ⋅ Shaozhe Hao ⋅ Yukang Cao ⋅ Kwan-Yee K. Wong
ExHall D Poster #240
Relation-Rich Visual Document Generator for Visual Information Extraction Poster Session 3
Zi-Han Jiang ⋅ Chien-Wei Lin ⋅ WeiHua Li ⋅ Hsuan-Tung Liu ⋅ Yi-Ren Yeh ⋅ Chu-Song Chen
ExHall D Poster #363
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation Poster Session 4
Haotong Lin ⋅ Sida Peng ⋅ Jingxiao Chen ⋅ Songyou Peng ⋅ Jiaming Sun ⋅ Minghuan Liu ⋅ Hujun Bao ⋅ Jiashi Feng ⋅ Xiaowei Zhou ⋅ Bingyi Kang
ExHall D Poster #120
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers Poster Session 1
Efstathios Karypidis ⋅ Ioannis Kakogeorgiou ⋅ Spyros Gidaris ⋅ Nikos Komodakis
ExHall D Poster #345
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery Poster Session 4
Enguang Wang ⋅ Zhimao Peng ⋅ Zhengyuan Xie ⋅ Fei Yang ⋅ Xialei Liu ⋅ Ming-Ming Cheng
ExHall D Poster #428
On the Out-Of-Distribution Generalization of Large Multimodal Models Poster Session 2
Xingxuan Zhang ⋅ Jiansheng Li ⋅ Wenjing Chu ⋅ junjia hai ⋅ Renzhe Xu ⋅ Yuqing Yang ⋅ Shikai Guan ⋅ Jiazheng Xu ⋅ Liping Jing ⋅ Peng Cui
ExHall D Poster #471
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation Poster Session 3
Xingguo Lv ⋅ Xingbo Dong ⋅ Liwen Wang ⋅ Jiewen Yang ⋅ Lei Zhao ⋅ Bin Pu ⋅ Zhe Jin ⋅ Xuejun Li
ExHall D Poster #476
Easy-editable Image Vectorization with Multi-layer Multi-scale Distributed Visual Feature Embedding Poster Session 5
Ye Chen ⋅ Zhangli Hu ⋅ Zhongyin Zhao ⋅ Yupeng Zhu ⋅ Yue Shi ⋅ Yuxuan Xiong ⋅ Bingbing Ni
ExHall D Poster #219
Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration Poster Session 5
Junyuan Deng ⋅ Xinyi Wu ⋅ Yongxing Yang ⋅ Congchao Zhu ⋅ Song Wang ⋅ Zhenyao Wu
ExHall D Poster #204
Towards Scalable Human-aligned Benchmark for Text-guided Image Editing Poster Session 4
Suho Ryu ⋅ Kihyun Kim ⋅ Eugene Baek ⋅ Dongsoo Shin ⋅ Joonseok Lee
ExHall D Poster #238
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens Poster Session 5
Zhangqi Jiang ⋅ Junkai Chen ⋅ Beier Zhu ⋅ Tingjin Luo ⋅ Yankun Shen ⋅ Xu Yang
ExHall D Poster #379
SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes Poster Session 5
Cheng-De Fan ⋅ Chen-Wei Chang ⋅ Yi-Ruei Liu ⋅ Jie-Ying Lee ⋅ Jiun-Long Huang ⋅ Yu-Chee Tseng ⋅ Yu-Lun Liu
ExHall D Poster #27
Scaling Inference Time Compute for Diffusion Models Poster Session 1
Nanye Ma ⋅ Shangyuan Tong ⋅ Haolin Jia ⋅ Hexiang Hu ⋅ Yu-Chuan Su ⋅ Mingda Zhang ⋅ Xuan Yang ⋅ Yandong Li ⋅ Tommi Jaakkola ⋅ Xuhui Jia ⋅ Saining Xie
ExHall D Poster #226
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models Poster Session 4
Zichen Miao ⋅ WEI CHEN ⋅ Qiang Qiu
ExHall D Poster #414
iG-6DoF: Model-free 6DoF Pose Estimation for Unseen Object via Iterative 3D Gaussian Splatting Poster Session 2
Tuo Cao ⋅ Fei LUO ⋅ Jiongming Qin ⋅ Yu Jiang ⋅ Yusen Wang ⋅ Chunxia Xiao
ExHall D Poster #101
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories Poster Session 5
Eric Hedlin ⋅ Munawar Hayat ⋅ Fatih Porikli ⋅ Kwang Moo Yi ⋅ Shweta Mahajan
ExHall D Poster #103
RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection Poster Session 5
Yunfei Long ⋅ Abhinav Kumar ⋅ Xiaoming Liu ⋅ Daniel Morris
ExHall D Poster #117
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization Poster Session 2
Yiyang Du ⋅ Xiaochen Wang ⋅ Chi Chen ⋅ Jiabo Ye ⋅ Yiru Wang ⋅ Peng Li ⋅ Ming Yan ⋅ Ji Zhang ⋅ Fei Huang ⋅ Zhifang Sui ⋅ Maosong Sun ⋅ Yang Liu
ExHall D Poster #385
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation Poster Session 4
Qi Lv ⋅ Hao Li ⋅ Xiang Deng ⋅ Rui Shao ⋅ Yinchuan Li ⋅ Jianye Hao ⋅ Longxiang Gao ⋅ MICHAEL YU WANG ⋅ Liqiang Nie
ExHall D Poster #153
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion Poster Session 4
Zixuan Chen ⋅ Yujin Wang ⋅ Xin Cai ⋅ Zhiyuan You ⋅ Zhe-Ming Lu ⋅ Fan Zhang ⋅ Shi Guo ⋅ Tianfan Xue
ExHall D Poster #25
Automated Proof of Polynomial Inequalities via Reinforcement Learning Poster Session 1
Banglong Liu ⋅ Niuniu Qi ⋅ Xia Zeng ⋅ Lydia Dehbi ⋅ Zhengfeng Yang
ExHall D Poster #467
Frequency Dynamic Convolution for Dense Image Prediction Poster Session 6
Linwei Chen ⋅ Lin Gu ⋅ Liang Li ⋅ Chenggang Yan ⋅ Ying Fu
ExHall D Poster #386
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification Poster Session 6
Yuhao Wang ⋅ Yongfeng Lv ⋅ Pingping Zhang ⋅ Huchuan Lu
ExHall D Poster #341
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost Poster Session 1
Haiyang Mei ⋅ Pengyu Zhang ⋅ Mike Zheng Shou
ExHall D Poster #310
GroupMamba: Efficient Group-Based Visual State Space Model Poster Session 3
Abdelrahman Shaker ⋅ Syed Talal Wasim ⋅ Salman Khan ⋅ Jürgen Gall ⋅ Fahad Shahbaz Khan
ExHall D Poster #407
How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions Poster Session 2
Aditya Prakash ⋅ Benjamin E Lundell ⋅ Dmitry Andreychuk ⋅ David Forsyth ⋅ Saurabh Gupta ⋅ Harpreet S. Sawhney
ExHall D Poster #158
Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces Poster Session 4
Souhail Hadgi ⋅ Luca Moschella ⋅ Andrea Santilli ⋅ Diego Gomez ⋅ Qixing Huang ⋅ Emanuele Rodolà ⋅ Simone Melzi ⋅ Maks Ovsjanikov
ExHall D Poster #383
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Poster Session 2
Zhenggang Tang ⋅ Yuchen Fan ⋅ Dilin Wang ⋅ Hongyu Xu ⋅ Rakesh Ranjan ⋅ Alexander G. Schwing ⋅ Zhicheng Yan
ExHall D Poster #57
Explicit Depth-Aware Blurry Video Frame Interpolation Guided by Differential Curves Poster Session 1
yan zaoming ⋅ pengcheng lei ⋅ Tingting Wang ⋅ Faming Fang ⋅ Junkang Zhang ⋅ Yaomin Huang ⋅ Haichuan Song
ExHall D Poster #170
OFER: Occluded Face Expression Reconstruction Poster Session 6
Pratheba Selvaraju ⋅ Victoria Abrevaya ⋅ Timo Bolkart ⋅ Rick Akkerman ⋅ Tianyu Ding ⋅ Faezeh Amjadi ⋅ Ilya Zharkov
ExHall D Poster #80
SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation Poster Session 3
Dekai Zhu ⋅ Yan Di ⋅ Stefan Gavranovic ⋅ Slobodan Ilic
ExHall D Poster #111
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method Poster Session 3
Xinshuai Song ⋅ weixing chen ⋅ Yang Liu ⋅ Weikai Chen ⋅ Guanbin Li ⋅ Liang Lin
ExHall D Poster #138
BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects Poster Session 6
Wanyue Zhang ⋅ Rishabh Dabral ⋅ Vladislav Golyanik ⋅ Vasileios Choutas ⋅ Eduardo Alvarado ⋅ Thabo Beeler ⋅ Marc Habermann ⋅ Christian Theobalt
ExHall D Poster #146
Open-World Amodal Appearance Completion Poster Session 2
Jiayang Ao ⋅ Yanbei Jiang ⋅ Qiuhong Ke ⋅ Krista A. Ehinger
ExHall D Poster #106
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models Poster Session 6
Yiqi Zhu ⋅ Ziyue Wang ⋅ Can Zhang ⋅ Peng Li ⋅ Yang Liu
ExHall D Poster #326
Detecting Adversarial Data Using Perturbation Forgery Poster Session 3
Qian Wang ⋅ Chen Li ⋅ Yuchen Luo ⋅ Hefei Ling ⋅ Shijuan Huang ⋅ Ruoxi Jia ⋅ Ning Yu
ExHall D Poster #312
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration Poster Session 3
Haina Qin ⋅ Wenyang Luo ⋅ Zewen Chen ⋅ Yufan Liu ⋅ Bing Li ⋅ Weiming Hu ⋅ libin wang ⋅ DanDan Zheng ⋅ Yuming Li
ExHall D Poster #202
Insightful Instance Features for 3D Instance Segmentation Poster Session 3
Wonseok Roh ⋅ Hwanhee Jung ⋅ Giljoo Nam ⋅ Dong In Lee ⋅ Hyeongcheol Park ⋅ Sang Ho Yoon ⋅ Jungseock Joo ⋅ Sangpil Kim
ExHall D Poster #326
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene Poster Session 5
Shengqiong Wu ⋅ Hao Fei ⋅ Jingkang Yang ⋅ Xiangtai Li ⋅ Juncheng Li ⋅ Hanwang Zhang ⋅ Tat-seng Chua
ExHall D Poster #335
Knowledge Bridger: Towards Training-Free Missing Modality Completion Poster Session 5
Guanzhou Ke ⋅ Shengfeng He ⋅ Xiao-Li Wang ⋅ Bo Wang ⋅ Guoqing Chao ⋅ Yuanyang Zhang ⋅ Yi Xie ⋅ HeXing Su
ExHall D Poster #464
EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing Poster Session 4
Gaoxiang Cong ⋅ Jiadong Pan ⋅ Liang Li ⋅ Yuankai Qi ⋅ Yuxin Peng ⋅ Anton van den Hengel ⋅ Jian Yang ⋅ Qingming Huang
ExHall D Poster #1
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Poster Session 1
Wenbo Hu ⋅ Xiangjun Gao ⋅ Xiaoyu Li ⋅ Sijie Zhao ⋅ Xiaodong Cun ⋅ Yong Zhang ⋅ Long Quan ⋅ Ying Shan
ExHall D Poster #171
TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer Poster Session 6
Jialun Liu ⋅ Jinbo Wu ⋅ Xiaobo Gao ⋅ JiaKui Hu ⋅ Bojun Xiong ⋅ Xing Liu ⋅ Chen Zhao ⋅ Hongbin Pei ⋅ Haocheng Feng ⋅ Yingying Li ⋅ Errui Ding ⋅ Jingdong Wang
ExHall D Poster #39
A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering Poster Session 3
Zheming Xu ⋅ He Liu ⋅ Congyan Lang ⋅ Tao Wang ⋅ Yidong Li ⋅ Michael C. Kampffmeyer
ExHall D Poster #467
Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining Poster Session 6
Shangquan Sun ⋅ Wenqi Ren ⋅ Juxiang Zhou ⋅ Shu Wang ⋅ Jianhou Gan ⋅ Xiaochun Cao
ExHall D Poster #188
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector Poster Session 1
Xiao Guo ⋅ Xiufeng Song ⋅ Yue Zhang ⋅ Xiaohong Liu ⋅ Xiaoming Liu
ExHall D Poster #381
VSNet: Focusing on the Linguistic Characteristics of Sign Language Poster Session 5
Yuhao Li ⋅ Xinyue Chen ⋅ Hongkai Li ⋅ Xiaorong Pu ⋅ Peng Jin ⋅ Yazhou Ren
ExHall D Poster #315
Active Hyperspectral Imaging Using an Event Camera Poster Session 1
Bohan Yu ⋅ Jinxiu Liang ⋅ Zhuofeng Wang ⋅ Bin Fan ⋅ Art Subpaasa ⋅ Boxin Shi ⋅ Imari Sato
ExHall D Poster #71
Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression Poster Session 1
Lucas Relic ⋅ Roberto Azevedo ⋅ Yang Zhang ⋅ Markus Gross ⋅ Christopher Schroers
ExHall D Poster #217
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Poster Session 4
Yuqian Yuan ⋅ Hang Zhang ⋅ Wentong Li ⋅ Zesen Cheng ⋅ Boqiang Zhang ⋅ Long Li ⋅ Xin Li ⋅ Deli Zhao ⋅ Wenqiao Zhang ⋅ Yueting Zhuang ⋅ Jianke Zhu ⋅ Lidong Bing
ExHall D Poster #303
Multi-modal Medical Diagnosis via Large-small Model Collaboration Poster Session 6
Wanyi Chen ⋅ Zihua Zhao ⋅ Jiangchao Yao ⋅ Ya Zhang ⋅ Jiajun Bu ⋅ Haishuai Wang
ExHall D Poster #442
SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity Poster Session 1
Chengzhi Wu ⋅ Yuxin Wan ⋅ Hao Fu ⋅ Julius Pfrommer ⋅ Zeyun Zhong ⋅ Junwei Zheng ⋅ Jiaming Zhang ⋅ Jürgen Beyerer
ExHall D Poster #109
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling Poster Session 3
Qi Zhu ⋅ Jiangwei Lao ⋅ Deyi Ji ⋅ Junwei Luo ⋅ Kang Wu ⋅ Yingying Zhang ⋅ Lixiang Ru ⋅ Jian Wang ⋅ Jingdong Chen ⋅ Ming Yang ⋅ Dong Liu ⋅ Feng Zhao
ExHall D Poster #391
AdaDARE-gamma: Balancing Stability and Plasticity in Multi-modal LLMs through Efficient Adaptation Poster Session 4
Jingyi Xie ⋅ Jintao Yang ⋅ Zhunchen Luo ⋅ Yunbo Cao ⋅ Qiang Gao ⋅ Mengyuan Zhang ⋅ Wenpeng Hu
ExHall D Poster #377
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Poster Session 3
Hanlin Wang ⋅ Hao Ouyang ⋅ Qiuyu Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Qifeng Chen ⋅ Yujun Shen ⋅ Limin Wang
ExHall D Poster #175
ABC-Former: Auxiliary Bimodal Cross-domain Transformer with Interactive Channel Attention for White Balance Poster Session 5
Yu-Cheng Chiu ⋅ GUAN-RONG CHEN ⋅ Zihao Chen ⋅ Yan-Tsung Peng
ExHall D Poster #20
Fingerprinting Denoising Diffusion Probabilistic Models Poster Session 6
Huan Teng ⋅ Yuhui Quan ⋅ Chengyu Wang ⋅ Jun Huang ⋅ Hui Ji
ExHall D Poster #252
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception Poster Session 3
Haijie Li ⋅ Yanmin Wu ⋅ Jiarui Meng ⋅ Qiankun Gao ⋅ Zhiyao Zhang ⋅ Ronggang Wang ⋅ Jian Zhang
ExHall D Poster #328
CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation Poster Session 3
Zhenhui Ding ⋅ Guilian Chen ⋅ Qin Zhang ⋅ Huisi Wu ⋅ Jing Qin
ExHall D Poster #477
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence Poster Session 2
Xuewu Lin ⋅ Tianwei Lin ⋅ Alan Huang ⋅ HONGYU XIE ⋅ Zhizhong Su
ExHall D Poster #348
Query Efficient Black-Box Visual Prompting with Subspace Learning Poster Session 1
Haozhen Zhang ⋅ Zhaogeng Liu ⋅ Hualin Zhang ⋅ Xingchen Li ⋅ Wanli Shi ⋅ Bin Gu ⋅ Yi Chang
ExHall D Poster #399
Directional Label Diffusion Model for Learning from Noisy Labels Poster Session 5
Senyu Hou ⋅ Gaoxia Jiang ⋅ Jia Zhang ⋅ Shangrong Yang ⋅ Husheng Guo ⋅ Yaqing Guo ⋅ Wenjian Wang
ExHall D Poster #450
AA-CLIP: Enhancing Zero-Shot Anomaly Detection via Anomaly-Aware CLIP Poster Session 1
wenxin ma ⋅ Xu Zhang ⋅ Qingsong Yao ⋅ Fenghe Tang ⋅ Chenxu Wu ⋅ Yingtai Li ⋅ Rui Yan ⋅ Zihang Jiang ⋅ S Kevin Zhou
ExHall D Poster #438
HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting Poster Session 1
Jingyu Lin ⋅ Jiaqi Gu ⋅ Lubin Fan ⋅ Bojian Wu ⋅ Yujing Lou ⋅ Renjie Chen ⋅ Ligang Liu ⋅ Jieping Ye
ExHall D Poster #58
Keyframe-Guided Creative Video Inpainting Poster Session 3
Yuwei Guo ⋅ Ceyuan Yang ⋅ Anyi Rao ⋅ Chenlin Meng ⋅ Omer Bar-Tal ⋅ Shuangrui Ding ⋅ Maneesh Agrawala ⋅ Dahua Lin ⋅ Bo Dai
ExHall D Poster #225
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention Poster Session 5
Kyungmin Jo ⋅ Jooyeol Yun ⋅ Jaegul Choo
ExHall D Poster #244
Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image Denoising Poster Session 2
Feiran Li ⋅ Haiyang Jiang ⋅ Daisuke Iso
ExHall D Poster #24
SfM-Free 3D Gaussian Splatting via Hierarchical Training Poster Session 5
Bo Ji ⋅ Angela Yao
ExHall D Poster #57
Heterogeneous Skeleton-Based Action Representation Learning Poster Session 4
Xiaoyan Ma ⋅ jidong kuang ⋅ Hongsong Wang ⋅ Jie Gui
ExHall D Poster #320
Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression Poster Session 4
Boqian Zhang ⋅ shen yang ⋅ Hao Chen ⋅ Chao Yang ⋅ Jing Jia ⋅ Guang Jiang
ExHall D Poster #112
Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation Poster Session 6
Ting Liu ⋅ Siyuan Li
ExHall D Poster #333
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining Poster Session 2
Yunze Liu ⋅ Li Yi
ExHall D Poster #410
Relation3D : Enhancing Relation Modeling for Point Cloud Instance Segmentation Poster Session 2
Edward LOO ⋅ Jiacheng Deng
ExHall D Poster #337
Style Quantization for Data-Efficient GAN Training Poster Session 2
Jian Wang ⋅ Xin Lan ⋅ Ji-Zhe Zhou ⋅ Yuxin Tian ⋅ Jiancheng Lv
ExHall D Poster #223
Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems Poster Session 6
Alejandro Castañeda Garcia ⋅ Jan Warchocki ⋅ Jan van Gemert ⋅ Daan Brinks ⋅ Nergis Tomen
ExHall D Poster #167
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image Poster Session 2
Hyeongjin Nam ⋅ Donghwan Kim ⋅ Jeongtaek Oh ⋅ Kyoung Mu Lee
ExHall D Poster #18
DepthCues: Evaluating Monocular Depth Perception in Large Vision Models Poster Session 4
Duolikun Danier ⋅ Mehmet Aygun ⋅ Changjian Li ⋅ Hakan Bilen ⋅ Oisin Mac Aodha
ExHall D Poster #405
TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection Poster Session 5
Yoon Gyo Jung ⋅ Jaewoo Park ⋅ Jaeho Yoon ⋅ Kuan-Chuan Peng ⋅ Wonchul Kim ⋅ Andrew Beng Jin Teoh ⋅ Octavia Camps
ExHall D Poster #430
Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware Minimization for Federated Learning Poster Session 5
Debora Caldarola ⋅ Pietro Cagnasso ⋅ Barbara Caputo ⋅ Marco Ciccone
ExHall D Poster #396
F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics Poster Session 4
Pramit Saha ⋅ Felix Wagner ⋅ Divyanshu Mishra ⋅ Can Peng ⋅ Anshul Thakur ⋅ David A. Clifton ⋅ Konstantinos Kamnitsas ⋅ Alison Noble
ExHall D Poster #401
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval Poster Session 4
Eric Xing ⋅ Pranavi Kolouju ⋅ Robert Pless ⋅ Abby Stylianou ⋅ Nathan Jacobs
ExHall D Poster #365
Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps Poster Session 2
Jeeyung Kim ⋅ Erfan Esmaeili Fakhabi ⋅ Qiang Qiu
ExHall D Poster #254
Navigating Image Restoration with VAR’s Distribution Alignment Prior Poster Session 2
Siyang Wang ⋅ Naishan Zheng ⋅ Jie Huang ⋅ Feng Zhao
ExHall D Poster #209
ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge Poster Session 3
Radu Berdan ⋅ Beril Besbinar ⋅ Christoph Reinders ⋅ Junji Otsuka ⋅ Daisuke Iso
ExHall D Poster #115
Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent Poster Session 2
Philip Doldo ⋅ Derek Everett ⋅ Amol Khanna ⋅ Andre T Nguyen ⋅ Edward Raff
ExHall D Poster #95
LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models Poster Session 3
Xuan Cai ⋅ Renjie Pan ⋅ Hua Yang
ExHall D Poster #403
DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos Poster Session 1
Lorenzo Mur-Labadia ⋅ Jose J. Guerrero ⋅ Ruben Martinez-Cantin
ExHall D Poster #315
Fortifying Federated Learning Towards Trustworthiness via Auditable Data Valuation and Verifiable Client Contribution Poster Session 1
Naveen Kumar Kummari ⋅ Ranjeet Ranjan Jha ⋅ Krishna Mohan Chalavadi ⋅ Ravindra Babu Tallamraju
ExHall D Poster #462
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices Poster Session 1
Junyan Lin ⋅ Haoran Chen ⋅ Yue Fan ⋅ Yingqi Fan ⋅ Xin Jin ⋅ Hui Su ⋅ Jinlan Fu ⋅ Xiaoyu Shen
ExHall D Poster #380
AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments Poster Session 4
Xiangyu Chang ⋅ Fahim Faisal Niloy ⋅ Sk Miraj Ahmed ⋅ Srikanth Krishnamurthy ⋅ Basak Guler ⋅ Ananthram Swami ⋅ Samet Oymak ⋅ Amit K. Roy-Chowdhury
ExHall D Poster #453
Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering Poster Session 4
Wenlong Fang ⋅ Qiaofeng Wu ⋅ Jing Chen ⋅ Yun Xue
ExHall D Poster #361
PLeaS - Merging Models with Permutations and Least Squares Poster Session 6
Anshul Nasery ⋅ Jonathan Hayase ⋅ Pang Wei Koh ⋅ Sewoong Oh
ExHall D Poster #416
Context-Enhanced Memory-Refined Transformer for Online Action Detection Poster Session 2
Zhanzhong Pang ⋅ Fadime Sener ⋅ Angela Yao
ExHall D Poster #318
Multi-modal Knowledge Distillation-based Human Trajectory Forecasting Poster Session 5
Jaewoo Jeong ⋅ Seohee Lee ⋅ Daehee Park ⋅ Giwon Lee ⋅ Kuk-Jin Yoon
ExHall D Poster #306
HORP: Human-Object Relation Priors Guided HOI Detection Poster Session 5
Pei Geng ⋅ Jian Yang ⋅ Shanshan Zhang
ExHall D Poster #409
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models Poster Session 5
Sai Kumar Dwivedi ⋅ Dimitrije Antić ⋅ Shashank Tripathi ⋅ Omid Taheri ⋅ Cordelia Schmid ⋅ Michael J. Black ⋅ Dimitrios Tzionas
ExHall D Poster #147
Golden Cudgel Network for Real-Time Semantic Segmentation Poster Session 5
Guoyu Yang ⋅ Yuan Wang ⋅ Daming Shi ⋅ Yanzhong Wang
ExHall D Poster #413
Viewpoint Rosetta Stone: Unlocking Unpaired Ego-Exo Videos for View-invariant Representation Learning Poster Session 4
Mi Luo ⋅ Zihui Xue ⋅ Alex Dimakis ⋅ Kristen Grauman
ExHall D Poster #88
Black Hole-Driven Identity Absorbing in Diffusion Models Poster Session 6
Muhammad Shaheryar ⋅ Jong Taek Lee ⋅ Soon Ki Jung
ExHall D Poster #227
Toward Robust Neural Reconstruction from Sparse Point Sets Poster Session 2
Amine Ouasfi ⋅ Shubhendu Jena ⋅ Eric Marchand ⋅ Adnane Boukhayma
ExHall D Poster #112
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion Poster Session 2
Qitao Zhao ⋅ Amy Lin ⋅ Jeff Tan ⋅ Jason Y. Zhang ⋅ Deva Ramanan ⋅ Shubham Tulsiani
ExHall D Poster #87
Gyro-based Neural Single Image Deblurring Poster Session 5
Heemin Yang ⋅ Jaesung Rim ⋅ Seungyong Lee ⋅ Seung-Hwan Baek ⋅ Sunghyun Cho
ExHall D Poster #195
HSI: A Holistic Style Injector for Arbitrary Style Transfer Poster Session 5
Shuhao Zhang ⋅ Hui Kang ⋅ Yang Liu ⋅ Fang Mei ⋅ Hongjuan Li
ExHall D Poster #227
Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction Poster Session 5
Huiwon Jang ⋅ Sihyun Yu ⋅ Jinwoo Shin ⋅ Pieter Abbeel ⋅ Younggyo Seo
ExHall D Poster #171
Reconstructing Animals and the Wild Poster Session 4
Peter Kulits ⋅ Michael J. Black ⋅ Silvia Zuffi
ExHall D Poster #70
NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting Poster Session 6
Yulong Zheng ⋅ Zicheng Jiang ⋅ Shengfeng He ⋅ Yandu Sun ⋅ Junyu Dong ⋅ Huaidong Zhang ⋅ Yong Du
ExHall D Poster #63
Decentralized Diffusion Models Poster Session 5
David McAllister ⋅ Matthew Tancik ⋅ Jiaming Song ⋅ Angjoo Kanazawa
ExHall D Poster #217
CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation Poster Session 5
Yuxing Long ⋅ Jiyao Zhang ⋅ Mingjie Pan ⋅ Tianshu Wu ⋅ Taewhan Kim ⋅ Hao Dong
ExHall D Poster #146
Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images Poster Session 1
Tai Nguyen ⋅ Aref Azizpour ⋅ Matthew Stamm
ExHall D Poster #275
CARL: A Framework for Equivariant Image Registration Poster Session 5
Hastings Greer ⋅ Lin Tian ⋅ François-Xavier Vialard ⋅ Roland Kwitt ⋅ Raúl San José Estépar ⋅ Marc Niethammer
ExHall D Poster #478
Autoregressive Distillation of Diffusion Transformers Poster Session 4
Yeongmin Kim ⋅ Sotiris Anagnostidis ⋅ Yuming Du ⋅ Edgar Schoenfeld ⋅ Jonas Kohler ⋅ Markos Georgopoulos ⋅ Albert Pumarola ⋅ Ali Thabet ⋅ Artsiom Sanakoyeu
ExHall D Poster #230
Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries Poster Session 1
Wei Xu ⋅ Charlie Wagner ⋅ Junjie Luo ⋅ Qi Guo
ExHall D Poster #25
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Poster Session 6
Sotiris Anagnostidis ⋅ Gregor Bachmann ⋅ Yeongmin Kim ⋅ Jonas Kohler ⋅ Markos Georgopoulos ⋅ Artsiom Sanakoyeu ⋅ Yuming Du ⋅ Albert Pumarola ⋅ Ali Thabet ⋅ Edgar Schoenfeld
ExHall D Poster #205
CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections Poster Session 6
Thomas Walker ⋅ Salvatore Esposito ⋅ Daniel Rebain ⋅ Amir Vaxman ⋅ Arno Onken ⋅ Changjian Li ⋅ Oisin Mac Aodha
ExHall D Poster #457
POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation Poster Session 3
Jian Wang ⋅ Tianhong Dai ⋅ Bingfeng Zhang ⋅ Siyue Yu ⋅ ENG GEE LIM ⋅ Jimin Xiao
ExHall D Poster #422
Perceptual Inductive Bias Is What You Need Before Contrastive Learning Poster Session 2
Junru Zhao ⋅ Tianqin Li ⋅ Dunhan Jiang ⋅ Shenghao Wu ⋅ Alan Ramirez ⋅ Tai Sing Lee
ExHall D Poster #405
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping Poster Session 2
Aashish Rai ⋅ Dilin Wang ⋅ Mihir Jain ⋅ Nikolaos Sarafianos ⋅ Kefan Chen ⋅ Srinath Sridhar ⋅ Aayush Prakash
ExHall D Poster #46
Z-Magic: Zero-shot Multiple Attributes Guided Image Creator Poster Session 4
Yingying Deng ⋅ Xiangyu He ⋅ Fan Tang ⋅ Weiming Dong
ExHall D Poster #247
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles Poster Session 1
Rui Zhao ⋅ Weijia Mao ⋅ Mike Zheng Shou
ExHall D Poster #256
Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation Poster Session 1
Yue Zhang ⋅ Mingyue Bin ⋅ Yuyang Zhang ⋅ Zhongyuan Wang ⋅ Zhen Han ⋅ Chao Liang
ExHall D Poster #454
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes Poster Session 6
Yuji Wang ⋅ Haoran Xu ⋅ Yong Liu ⋅ Jiaze Li ⋅ Yansong Tang
ExHall D Poster #264
MC^2: Multi-concept Guidance for Customized Multi-concept Generation Poster Session 1
Jiaxiu Jiang ⋅ Yabo Zhang ⋅ Kailai Feng ⋅ Xiaohe Wu ⋅ Wenbo Li ⋅ Renjing Pei ⋅ Fan Li ⋅ Wangmeng Zuo
ExHall D Poster #253
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation Poster Session 4
Leigang Qu ⋅ Haochuan Li ⋅ Wenjie Wang ⋅ Xiang Liu ⋅ Juncheng Li ⋅ Liqiang Nie ⋅ Tat-seng Chua
ExHall D Poster #260
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning Poster Session 3
Xiao-Hui Li ⋅ Fei Yin ⋅ Cheng-Lin Liu
ExHall D Poster #418
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Poster Session 1
Yuheng Ji ⋅ Huajie Tan ⋅ Jiayu Shi ⋅ Xiaoshuai Hao ⋅ Yuan Zhang ⋅ Hengyuan Zhang ⋅ Pengwei Wang ⋅ Mengdi Zhao ⋅ Yao Mu ⋅ Pengju An ⋅ Xinda Xue ⋅ Qinghang Su ⋅ Huaihai Lyu ⋅ Xiaolong Zheng ⋅ Jiaming Liu ⋅ Zhongyuan Wang ⋅ Shanghang Zhang
ExHall D Poster #145
RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories Poster Session 4
Huiyang Shao ⋅ Xin Xia ⋅ Yuhong Yang ⋅ Ren Yuxi ⋅ XING WANG ⋅ Xuefeng Xiao
ExHall D Poster #220
Generative Sparse-View Gaussian Splatting Poster Session 6
Hanyang Kong ⋅ Xingyi Yang ⋅ Xinchao Wang
ExHall D Poster #58
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting Poster Session 5
Jingyi Xu ⋅ Xieyuanli Chen ⋅ Junyi Ma ⋅ Jiawei Huang ⋅ Jintao Xu ⋅ Yue Wang ⋅ Ling Pei
ExHall D Poster #123
Diffusion Model is Effectively Its Own Teacher Poster Session 3
Xinyin Ma ⋅ Runpeng Yu ⋅ Songhua Liu ⋅ Gongfan Fang ⋅ Xinchao Wang
ExHall D Poster #215
AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning Poster Session 1
Kaixuan Wu ⋅ Xinde Li ⋅ Xinglin Li ⋅ Chuanfei Hu ⋅ Guoliang Wu
ExHall D Poster #295
Science-T2I: Addressing Scientific Illusions in Image Synthesis Poster Session 1
Jialuo Li ⋅ Wenhao Chai ⋅ XINGYU FU ⋅ Haiyang Xu ⋅ Saining Xie
ExHall D Poster #247
EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering Poster Session 4
Baili Xiao ⋅ Zhibin Dong ⋅ KE LIANG ⋅ Suyuan Liu ⋅ Siwei Wang ⋅ Tianrui Liu ⋅ Xingchen Hu ⋅ En Zhu ⋅ Xinwang Liu
ExHall D Poster #467
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer Poster Session 4
ruojun xu ⋅ Weijie Xi ⋅ Xiaodi Wang ⋅ Yongbo Mao ⋅ Zach Cheng
ExHall D Poster #235
SoftShadow: Leveraging Soft Masks for Penumbra-Aware Shadow Removal Poster Session 5
Xinrui Wang ⋅ Lanqing Guo ⋅ Xiyu Wang ⋅ Siyu Huang ⋅ Bihan Wen
ExHall D Poster #206
Hearing Anywhere in Any Environment Poster Session 2
Xiulong Liu ⋅ Anurag Kumar ⋅ Paul Calamia ⋅ Sebastia Vicenc Amengual Gari ⋅ Calvin Murdock ⋅ Ishwarya Ananthabhotla ⋅ Philip W Robinson ⋅ Eli Shlizerman ⋅ Vamsi Krishna Ithapu ⋅ Ruohan Gao
ExHall D Poster #27
ProjAttacker: A Configurable Physical Adversarial Attack for Face Recognition via Projector Poster Session 5
Yuanwei Liu ⋅ Hui Wei ⋅ Chengyu Jia ⋅ Ruqi Xiao ⋅ Weijian Ruan ⋅ Xingxing Wei ⋅ Joey Tianyi Zhou ⋅ Zheng Wang
ExHall D Poster #19
Reasoning to Attend: Try to Understand How <SEG> Token Works Poster Session 5
Rui Qian ⋅ Xin Yin ⋅ Dejing Dou
ExHall D Poster #353
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines Poster Session 6
Chen Tang ⋅ Xinzhu Ma ⋅ Encheng Su ⋅ Xiufeng Song ⋅ Xiaohong Liu ⋅ Wei-Hong Li ⋅ Lei Bai ⋅ Wanli Ouyang ⋅ Xiangyu Yue
ExHall D Poster #293
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations Poster Session 1
Xunzhi Zheng ⋅ Dan Xu
ExHall D Poster #77
AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation Poster Session 2
Zeyi Xu ⋅ Jinfan Liu ⋅ Kuangxu Chen ⋅ Ye Chen ⋅ Zhangli Hu ⋅ Bingbing Ni
ExHall D Poster #34
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Poster Session 2
Wenyi Hong ⋅ Yean Cheng ⋅ Zhuoyi Yang ⋅ Weihan Wang ⋅ Lefan Wang ⋅ Xiaotao Gu ⋅ Shiyu Huang ⋅ Yuxiao Dong ⋅ Jie Tang
ExHall D Poster #294
Image Referenced Sketch Colorization Based on Animation Creation Workflow Poster Session 5
Dingkun Yan ⋅ Xinrui Wang ⋅ Zhuoru Li ⋅ Suguru Saito ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo ⋅ Jiaxian Guo
ExHall D Poster #223
The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generationf Poster Session 1
Yanis Benidir ⋅ Nicolas Gonthier ⋅ Clement Mallet
ExHall D Poster #191
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments Poster Session 6
Haisheng Su ⋅ Feixiang Song ⋅ CONG MA ⋅ Wei Wu ⋅ Junchi Yan
ExHall D Poster #123
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment Poster Session 6
Chen Liu ⋅ Peike Li ⋅ Liying Yang ⋅ Dadong Wang ⋅ Lincheng Li ⋅ Xin Yu
ExHall D Poster #262
OmniStyle: Filtering High Quality Style Transfer Data at Scale Poster Session 2
Ye Wang ⋅ Ruiqi Liu ⋅ Jiang Lin ⋅ Fei Liu ⋅ Zili Yi ⋅ Yilin Wang ⋅ Rui Ma
ExHall D Poster #237
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text Poster Session 1
Guotao liang ⋅ Baoquan Zhang ⋅ Zhiyuan Wen ⋅ Junteng Zhao ⋅ Yunming Ye ⋅ Guangming Ye ⋅ Yao He
ExHall D Poster #371
StyleMaster: Stylize Your Video with Artistic Generation and Translation Poster Session 1
Zixuan Ye ⋅ Huijuan Huang ⋅ Xintao Wang ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Wenhan Luo
ExHall D Poster #236
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Poster Session 2
Yilun Zhao ⋅ Lujing Xie ⋅ Haowei Zhang ⋅ Guo Gan ⋅ Weiyuan Chen ⋅ Yitao Long ⋅ Tongyan Hu ⋅ Zhijian Xu ⋅ Chengye Wang ⋅ Chuhan Li ⋅ Ziyao Shangguan ⋅ Yixin Liu ⋅ Zhenwen Liang ⋅ Zhiyuan Hu ⋅ Chen Zhao ⋅ Arman Cohan
ExHall D Poster #296
Learning Visual Composition through Improved Semantic Guidance Poster Session 1
Austin Stone ⋅ Hagen Soltau ⋅ Robert Geirhos ⋅ Xi Yi ⋅ Ye Xia ⋅ Bingyi Cao ⋅ Kaifeng Chen ⋅ Abhijit Ogale ⋅ Jonathon Shlens
ExHall D Poster #340
High Dynamic Range Video Compression: A Large-Scale Benchmark Dataset and A Learned Bit-depth Scalable Compression Algorithm Poster Session 2
Zhaoyi Tian ⋅ Feifeng Wang ⋅ Shiwei Wang ⋅ Zihao Zhou ⋅ Yao Zhu ⋅ Liquan Shen
ExHall D Poster #187
Adaptive Parameter Selection for Tuning Vision-Language Models Poster Session 1
Yi Zhang ⋅ Yi-Xuan Deng ⋅ Meng-Hao Guo ⋅ Shi-Min Hu
ExHall D Poster #392
DL2G: Degradation-guided Local-to-Global Restoration for Eyeglass Reflection Removal Poster Session 4
Yizhilv ⋅ Xiao Lu ⋅ Hong Ding ⋅ Jingbo Hu ⋅ Zhi Jiang ⋅ Chunxia Xiao
ExHall D Poster #19
UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping Poster Session 3
Wenbo Wang ⋅ Fangyun Wei ⋅ Lei Zhou ⋅ Xi Chen ⋅ Lin Luo ⋅ Xiaohan Yi ⋅ Yizhong Zhang ⋅ Yaobo Liang ⋅ Chang Xu ⋅ Yan Lu ⋅ Jiaolong Yang ⋅ Baining Guo
ExHall D Poster #149
RestorGS: Depth-aware Gaussian Splatting for Efficient 3D Scene Restoration Poster Session 3
Yuanjian Qiao ⋅ Mingwen Shao ⋅ Lingzhuang Meng ⋅ Kai Xu
ExHall D Poster #50
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning Poster Session 2
Kailin Li ⋅ Puhao Li ⋅ Tengyu Liu ⋅ Yuyang Li ⋅ Siyuan Huang
ExHall D Poster #155
Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild Poster Session 3
Wei Liu ⋅ Yufei Chen ⋅ Xiaodong Yue
ExHall D Poster #465
Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition Poster Session 4
Zhiyuan Chen ⋅ Keyi Li ⋅ Yifan Jia ⋅ Le Ye ⋅ Yufei Ma
ExHall D Poster #210
GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill Poster Session 4
Jieming Cui ⋅ Tengyu Liu ⋅ Ziyu Meng ⋅ Jiale Yu ⋅ Ran Song ⋅ Wei Zhang ⋅ Yixin Zhu ⋅ Siyuan Huang
ExHall D Poster #145
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Poster Session 2
Yue Chen ⋅ Xingyu Chen ⋅ Anpei Chen ⋅ Gerard Pons-Moll ⋅ Yuliang Xiu
ExHall D Poster #93
STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search Poster Session 6
Yuning Qiu ⋅ Andong Wang ⋅ Chao Li ⋅ Haonan Huang ⋅ Guoxu Zhou ⋅ Qibin Zhao
ExHall D Poster #236
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Poster Session 3
Shenghai Yuan ⋅ Jinfa Huang ⋅ Xianyi He ⋅ Yunyang Ge ⋅ Yujun Shi ⋅ Liuhan Chen ⋅ Jiebo Luo ⋅ Li Yuan
ExHall D Poster #222
Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes Poster Session 1
Yiming Dou ⋅ Wonseok Oh ⋅ Yuqing Luo ⋅ Antonio Loquercio ⋅ Andrew Owens
ExHall D Poster #151
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation Poster Session 3
Shivam Duggal ⋅ Yushi Hu ⋅ Oscar Michel ⋅ Aniruddha Kembhavi ⋅ William Freeman ⋅ Noah A. Smith ⋅ Ranjay Krishna ⋅ Antonio Torralba ⋅ Ali Farhadi ⋅ Wei-Chiu Ma
ExHall D Poster #255
PromptHMR: Promptable Human Mesh Recovery Poster Session 1
Yufu Wang ⋅ Yu Sun ⋅ Priyanka Patel ⋅ Kostas Daniilidis ⋅ Michael J. Black ⋅ Muhammed Kocabas
ExHall D Poster #91
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness Poster Session 4
Tianyu Yu ⋅ Haoye Zhang ⋅ Qiming Li ⋅ Qixin Xu ⋅ Yuan Yao ⋅ Da Chen ⋅ Xiaoman Lu ⋅ Ganqu Cui ⋅ Yunkai Dang ⋅ Taiwen He ⋅ Xiaocheng Feng ⋅ Jun Song ⋅ Bo Zheng ⋅ Zhiyuan Liu ⋅ Tat-seng Chua ⋅ Maosong Sun
ExHall D Poster #399
RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing Poster Session 1
Zhipeng Huang ⋅ Wangbo Yu ⋅ Xinhua Cheng ⋅ ChengShu Zhao ⋅ Yunyang Ge ⋅ Mingyi Guo ⋅ Li Yuan ⋅ Yonghong Tian
ExHall D Poster #38
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians Poster Session 6
Changfeng Ma ⋅ Ran Bi ⋅ Jie Guo ⋅ Chongjun Wang ⋅ Yanwen Guo
ExHall D Poster #108
MBQ: Modality-Balanced Quantization for Large Vision-Language Models Poster Session 1
Shiyao Li ⋅ Yingchun Hu ⋅ Xuefei Ning ⋅ Xihui Liu ⋅ Ke Hong ⋅ xiaotao jia ⋅ Xiuhong Li ⋅ Yaqi Yan ⋅ PEI RAN ⋅ Guohao Dai ⋅ Shengen Yan ⋅ Huazhong Yang ⋅ Yu Wang
ExHall D Poster #382
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction Poster Session 6
Zijian He ⋅ Yuwei Ning ⋅ Yipeng Qin ⋅ Guangrun Wang ⋅ Sibei Yang ⋅ Liang Lin ⋅ Guanbin Li
ExHall D Poster #19
Associative Transformer Poster Session 1
Yuwei Sun ⋅ Hideya Ochiai ⋅ Zhirong Wu ⋅ Stephen Lin ⋅ Ryota Kanai
ExHall D Poster #417
USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting Poster Session 4
Kang Chen ⋅ Jiyuan Zhang ⋅ Zecheng Hao ⋅ Yajing Zheng ⋅ Tiejun Huang ⋅ Zhaofei Yu
ExHall D Poster #74
SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction Poster Session 2
Xinran Yang ⋅ Donghao Ji ⋅ Yuanqi Li ⋅ Jie Guo ⋅ Yanwen Guo ⋅ Junyuan Xie
ExHall D Poster #33
EdgeMovingNet: Edge-preserving Point Cloud Reconstruction via Joint Geometry Features Poster Session 5
Xinran Yang ⋅ Donghao Ji ⋅ Yuanqi Li ⋅ Junyuan Xie ⋅ Jie Guo ⋅ Yanwen Guo
ExHall D Poster #105
VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding Poster Session 1
Kangsan Kim ⋅ Geon Park ⋅ Youngwan Lee ⋅ Woongyeong Yeo ⋅ Sung Ju Hwang
ExHall D Poster #299
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos Poster Session 5
Chiara Plizzari ⋅ Alessio Tonioni ⋅ Yongqin Xian ⋅ Achin Kulshrestha ⋅ Federico Tombari
ExHall D Poster #297
MambaIC: State Space Models for High-Performance Learned Image Compression Poster Session 4
Fanhu Zeng ⋅ Hao Tang ⋅ Yihua Shao ⋅ Siyu Chen ⋅ Ling Shao ⋅ Yan Wang
ExHall D Poster #213
Six-CD: Benchmarking Concept Removals for Text-to-image Diffusion Models Poster Session 6
Jie Ren ⋅ Kangrui Chen ⋅ Yingqian Cui ⋅ Shenglai Zeng ⋅ Hui Liu ⋅ Yue Xing ⋅ Jiliang Tang ⋅ Lingjuan Lyu
ExHall D Poster #248
Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space Poster Session 1
Yifan Zhou ⋅ Zeqi Xiao ⋅ Shuai Yang ⋅ Xingang Pan
ExHall D Poster #214
UniScene: Unified Occupancy-centric Driving Scene Generation Poster Session 3
Bohan Li ⋅ Jiazhe Guo ⋅ Hongsi Liu ⋅ Yingshuang Zou ⋅ Yikang Ding ⋅ Xiwu Chen ⋅ Hu ZHU ⋅ Feiyang Tan ⋅ Chi Zhang ⋅ Tiancai Wang ⋅ Shuchang Zhou ⋅ Li Zhang ⋅ Xiaojuan Qi ⋅ Hao Zhao ⋅ Mu Yang ⋅ Wenjun Zeng ⋅ Xin Jin
ExHall D Poster #128
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset Poster Session 1
Zhao Dong ⋅ Ka chen ⋅ Zhaoyang Lv ⋅ Hong-Xing Yu ⋅ Yunzhi Zhang ⋅ Cheng Zhang ⋅ Yufeng Zhu ⋅ Stephen Tian ⋅ Zhengqin Li ⋅ Geordie Moffatt ⋅ Sean Christofferson ⋅ James Fort ⋅ Xiaqing Pan ⋅ Mingfei Yan ⋅ Jiajun Wu ⋅ Carl Ren ⋅ Richard Newcombe
ExHall D Poster #55
Visual Persona: Foundation Model for Full-Body Human Customization Poster Session 4
Jisu Nam ⋅ Soowon Son ⋅ Zhan Xu ⋅ Jing Shi ⋅ Difan Liu ⋅ Feng Liu ⋅ Seungryong Kim ⋅ Yang Zhou
ExHall D Poster #272
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation Poster Session 5
Dingcheng Zhen ⋅ Shunshun Yin ⋅ Shiyang Qin ⋅ Hou Yi ⋅ Ziwei Zhang ⋅ Siyuan Liu ⋅ Gan Qi ⋅ Ming Tao
ExHall D Poster #3
Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties Poster Session 2
wenqiao Li ⋅ BoZhong Zheng ⋅ Xiaohao Xu ⋅ Jinye Gan ⋅ Fading Lu ⋅ Xiang Li ⋅ Na Ni ⋅ Zheng Tian ⋅ Xiaonan Huang ⋅ Shenghua Gao ⋅ Yingna Wu
ExHall D Poster #439
Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images Poster Session 2
Wensheng Cheng ⋅ Zhenghong Li ⋅ Jiaxiang Ren ⋅ Hyomin Jeong ⋅ Congwu Du ⋅ Yingtian Pan ⋅ Haibin Ling
ExHall D Poster #485
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts Poster Session 6
Shijia Zhao ⋅ Qiming Xia ⋅ Xusheng Guo ⋅ Pufan Zou ⋅ Maoji Zheng ⋅ Hai Wu ⋅ Chenglu Wen ⋅ Cheng Wang
ExHall D Poster #308
NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval Poster Session 2
Zengrong Lin ⋅ Zheng Wang ⋅ Tianwen Qian ⋅ Pan Mu ⋅ Sixian Chan ⋅ Cong Bai
ExHall D Poster #371
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs Poster Session 3
Ziheng Ouyang ⋅ Zhen Li ⋅ Qibin Hou
ExHall D Poster #228
HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator Poster Session 1
Fan Yang ⋅ Ru Zhen ⋅ Jianing Wang ⋅ Yanhao Zhang ⋅ Haoxiang Chen ⋅ Haonan Lu ⋅ Sicheng Zhao ⋅ Guiguang Ding
ExHall D Poster #351
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI Poster Session 3
Siyuan Cheng ⋅ Lingjuan Lyu ⋅ Zhenting Wang ⋅ Xiangyu Zhang ⋅ Vikash Sehwag
ExHall D Poster #268
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks Poster Session 1
Yifei Liu ⋅ Zhihang Zhong ⋅ Yifan Zhan ⋅ Sheng Xu ⋅ Xiao Sun
ExHall D Poster #48
CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving Poster Session 4
Dongkun Zhang ⋅ Jiaming Liang ⋅ Ke Guo ⋅ Sha Lu ⋅ Qi Wang ⋅ Rong Xiong ⋅ Zhenwei Miao ⋅ Yue Wang
ExHall D Poster #136
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models Poster Session 1
Yunzhi Yan ⋅ Zhen Xu ⋅ Haotong Lin ⋅ Haian Jin ⋅ Haoyu Guo ⋅ Yida Wang ⋅ Kun Zhan ⋅ XianPeng Lang ⋅ Hujun Bao ⋅ Xiaowei Zhou ⋅ Sida Peng
ExHall D Poster #61
FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction Poster Session 5
Yifan Wang ⋅ Peishan Yang ⋅ Zhen Xu ⋅ Jiaming Sun ⋅ Zhanhua Zhang ⋅ chen yong ⋅ Hujun Bao ⋅ Sida Peng ⋅ Xiaowei Zhou
ExHall D Poster #66
DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds Poster Session 3
Youyu Chen ⋅ Junjun Jiang ⋅ Kui Jiang ⋅ Xiao Tang ⋅ Zhihao Li ⋅ Xianming Liu ⋅ Yinyu Nie
ExHall D Poster #47
Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable Poster Session 1
Xin Jin ⋅ Simon Niklaus ⋅ Zhoutong Zhang ⋅ Zhihao Xia ⋅ Chun-Le Guo ⋅ Yuting Yang ⋅ Jiawen Chen ⋅ Chongyi Li
ExHall D Poster #180
Words or Vision: Do Vision-Language Models Have Blind Faith in Text? Poster Session 1
Ailin Deng ⋅ Tri Cao ⋅ Zhirui Chen ⋅ Bryan Hooi
ExHall D Poster #352
Steepest Descent Density Control for Compact 3D Gaussian Splatting Poster Session 6
Peihao Wang ⋅ Yuehao Wang ⋅ Dilin Wang ⋅ Sreyas Mohan ⋅ Zhiwen Fan ⋅ Lemeng Wu ⋅ Ruisi Cai ⋅ Yu-Ying Yeh ⋅ Zhangyang Wang ⋅ Qiang Liu ⋅ Rakesh Ranjan
ExHall D Poster #48
Unboxed: Geometrically and Temporally Consistent Video Outpainting Poster Session 2
Zhongrui Yu ⋅ Martina Megaro-Boldini ⋅ Robert Sumner ⋅ Abdelaziz Djelouah
ExHall D Poster #186
Unlearning through Knowledge Overwriting: Reversible Federated Unlearning via Selective Sparse Adapter Poster Session 6
Zhengyi Zhong ⋅ Weidong Bao ⋅ Ji Wang ⋅ Shuai Zhang ⋅ Jingxuan Zhou ⋅ Lingjuan Lyu ⋅ Wei Yang Bryan Lim
ExHall D Poster #432
Improving the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation Poster Session 1
Fengfan Zhou ⋅ Bangjie Yin ⋅ Hefei Ling ⋅ Qianyu Zhou ⋅ Wenxuan Wang
ExHall D Poster #319
Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement Poster Session 3
Xinjie Li ⋅ Ziyi Chen ⋅ Xinlu Yu ⋅ Iek-Heng Chu ⋅ Peng Chang ⋅ Jing Xiao
ExHall D Poster #69
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge Poster Session 3
Vishwesh Nath ⋅ Wenqi Li ⋅ Dong Yang ⋅ Andriy Myronenko ⋅ Yao Lu ⋅ Zhijian Liu ⋅ Danny Yin ⋅ Yucheng Tang ⋅ Pengfei Guo ⋅ Ziyue Xu ⋅ Can Zhao ⋅ Yufan He ⋅ Greg Heinrich ⋅ Mingxin Zheng ⋅ Benjamin D. Simon ⋅ Stephanie Anne Harmon ⋅ Michael Zephyr ⋅ Marc Edgar ⋅ Stephen R. Aylward ⋅ Pavlo Molchanov ⋅ Yan Mee LAW ⋅ Baris Turkbey ⋅ Holger R. Roth ⋅ Daguang Xu
ExHall D Poster #396
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence Poster Session 1
Haolin Liu ⋅ Xiaohang Zhan ⋅ Zizheng Yan ⋅ Zhongjin Luo ⋅ Yuxin Wen ⋅ Xiaoguang Han
ExHall D Poster #70
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Poster Session 3
Chengyue Wu ⋅ Xiaokang Chen ⋅ Zhiyu Wu ⋅ Yiyang Ma ⋅ Xingchao Liu ⋅ Zizheng Pan ⋅ Wen Liu ⋅ Zhenda Xie ⋅ Xingkai Yu ⋅ Chong Ruan ⋅ Ping Luo
ExHall D Poster #221
Improved Video VAE for Latent Video Diffusion Model Poster Session 4
Pingyu Wu ⋅ Kai Zhu ⋅ Yu Liu ⋅ Liming Zhao ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
ExHall D Poster #221
Towards Continual Universal Segmentation Poster Session 6
Zihan Lin ⋅ Zilei Wang ⋅ Xu Wang
ExHall D Poster #312
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Poster Session 5
Zeyi Huang ⋅ Yuyang Ji ⋅ Xiaofang Wang ⋅ Nikhil Mehta ⋅ Tong Xiao ⋅ Donghyun Lee ⋅ Sigmund VanValkenburgh ⋅ Shengxin Zha ⋅ Bolin Lai ⋅ Licheng Yu ⋅ Ning Zhang ⋅ Yong Jae Lee ⋅ Miao Liu
ExHall D Poster #301
Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation Poster Session 4
Jialai Wang ⋅ Yuxiao Wu ⋅ Weiye Xu ⋅ Yating Huang ⋅ Chao Zhang ⋅ Zongpeng Li ⋅ Mingwei Xu ⋅ Zhenkai Liang
ExHall D Poster #410
HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation Poster Session 4
Hongwei Zheng ⋅ Han Li ⋅ Wenrui Dai ⋅ Ziyang Zheng ⋅ Chenglin Li ⋅ Junni Zou ⋅ Hongkai Xiong
ExHall D Poster #94
Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution Poster Session 2
Hang Xu ⋅ Jie Huang ⋅ Wei Yu ⋅ Jiangtong Tan ⋅ Zhen Zou ⋅ Feng Zhao
ExHall D Poster #205
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Poster Session 5
Zehuan Huang ⋅ Yuanchen Guo ⋅ Xingqiao An ⋅ Yunhan Yang ⋅ Yangguang Li ⋅ Zi-Xin Zou ⋅ Ding Liang ⋅ Xihui Liu ⋅ Yan-Pei Cao ⋅ Lu Sheng
ExHall D Poster #250
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting Poster Session 2
Suzhen Wang ⋅ Weijie Chen ⋅ Wei Zhang ⋅ Minda Zhao ⋅ Lincheng Li ⋅ Rongsheng Zhang ⋅ Zhipeng Hu ⋅ Xin Yu
ExHall D Poster #13
World-consistent Video Diffusion with Explicit 3D Modeling Poster Session 5
Qihang Zhang ⋅ Shuangfei Zhai ⋅ Miguel Ángel Bautista ⋅ Kevin Miao ⋅ Alexander Toshev ⋅ Joshua Susskind ⋅ Jiatao Gu
ExHall D Poster #60
Generative Gaussian Splatting for Unbounded 3D City Generation Poster Session 2
Haozhe Xie ⋅ Zhaoxi Chen ⋅ Fangzhou Hong ⋅ Ziwei Liu
ExHall D Poster #64
From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization Poster Session 5
Chao Yuan ⋅ Guiwei Zhang ⋅ Changxiao Ma ⋅ Tianyi Zhang ⋅ Guanglin Niu
ExHall D Poster #323
Tightening Robustness Verification of MaxPool-based Neural Networks via Minimizing the Over-Approximation Zone Poster Session 4
Yuan Xiao ⋅ Yuchen Chen ⋅ Shiqing Ma ⋅ Chunrong Fang ⋅ Tongtong Bai ⋅ Mingzheng Gu ⋅ Yuxin Cheng ⋅ Yanwei Chen ⋅ Zhenyu Chen
ExHall D Poster #465
Human Motion Instruction Tuning Poster Session 4
Lei Li ⋅ Sen Jia ⋅ Jianhao Wang ⋅ Zhongyu Jiang ⋅ Feng Zhou ⋅ Ju Dai ⋅ Tianfang Zhang ⋅ Zongkai Wu ⋅ Jenq-Neng Hwang
ExHall D Poster #170
Spectral Informed Mamba for Robust Point Cloud Processing Poster Session 3
Ali Bahri ⋅ Moslem Yazdanpanah ⋅ Mehrdad Noori ⋅ Sahar Dastani ⋅ Milad Cheraghalikhani ⋅ David OSOWIECHI ⋅ Gustavo Vargas Hakim ⋅ Farzad Beizaee ⋅ Ismail Ben Ayed ⋅ Christian Desrosiers
ExHall D Poster #112
What Makes a Good Dataset for Knowledge Distillation? Poster Session 5
Logan Frank ⋅ Jim Davis
ExHall D Poster #262
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video Poster Session 3
Hoang Chuong Nguyen ⋅ Wei Mao ⋅ Jose M. Alvarez ⋅ Miaomiao Liu
ExHall D Poster #79
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table Poster Session 6
Yusuke Matsui
ExHall D Poster #410
Sea-ing in Low-light Poster Session 4
Nisha Varghese ⋅ A. N. Rajagopalan
ExHall D Poster #76
Exploring Simple Open-Vocabulary Semantic Segmentation Poster Session 6
Zihang Lai
ExHall D Poster #390
G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation Poster Session 1
Tianxing Chen ⋅ Yao Mu ⋅ Zhixuan Liang ⋅ Zanxin Chen ⋅ ShijiaPeng ⋅ Qiangyu Chen ⋅ Mingkun Xu ⋅ Ruizhen Hu ⋅ Hongyuan Zhang ⋅ Xuelong Li ⋅ Ping Luo
ExHall D Poster #146
DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering Poster Session 1
Yexing Xu ⋅ Longguang Wang ⋅ Minglin Chen ⋅ Sheng Ao ⋅ Li Li ⋅ Yulan Guo
ExHall D Poster #50
Structure from Collision Poster Session 4
Takuhiro Kaneko
ExHall D Poster #44
SyncSDE: A Probabilistic Framework for Diffusion Synchronization Poster Session 4
Hyunjun Lee ⋅ Hyunsoo Lee ⋅ Sookwan Han
ExHall D Poster #163
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation Poster Session 1
Xiaozhong Ji ⋅ Xiaobin Hu ⋅ Zhihong Xu ⋅ Junwei Zhu ⋅ Chuming Lin ⋅ Qingdong He ⋅ Jiangning Zhang ⋅ Donghao Luo ⋅ Yi Chen ⋅ Qin Lin ⋅ qinglin lu ⋅ Chengjie Wang
ExHall D Poster #3
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning Poster Session 1
Jeongryong Lee ⋅ Yejee Shin ⋅ Geonhui Son ⋅ Dosik Hwang
ExHall D Poster #369
SEEN-DA: SEmantic ENtropy guided Domain-aware Attention for Domain Adaptive Object Detection Poster Session 5
Haochen Li ⋅ Rui Zhang ⋅ Hantao Yao ⋅ Xin Zhang ⋅ Yifan Hao ⋅ Xinkai Song ⋅ Shaohui Peng ⋅ Yongwei Zhao ⋅ Zhao Chen ⋅ Yanjun Wu ⋅ Ling Li
ExHall D Poster #422
OffsetOPT: Explicit Surface Reconstruction without Normals Poster Session 3
Huan Lei
ExHall D Poster #104
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation Poster Session 2
Sirui Xu ⋅ Dongting Li ⋅ Yucheng Zhang ⋅ Xiyan Xu ⋅ Qi Long ⋅ Ziyin Wang ⋅ Yunzhi Lu ⋅ Shuchang Dong ⋅ Hezi Jiang ⋅ Akshat Gupta ⋅ Yu-Xiong Wang ⋅ Liangyan Gui
ExHall D Poster #162
Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression Poster Session 3
Jie Liu ⋅ Tiexin Qin ⋅ Hui Liu ⋅ Yilei Shi ⋅ Lichao Mou ⋅ Xiao Xiang Zhu ⋅ Shiqi Wang ⋅ Haoliang Li
ExHall D Poster #470
One-for-More: Continual Diffusion Model for Anomaly Detection Poster Session 1
Xiaofan Li ⋅ Xin Tan ⋅ Zhuo Chen ⋅ Zhizhong Zhang ⋅ Ruixin Zhang ⋅ Rizen Guo ⋅ Guannan Jiang ⋅ Yulong Chen ⋅ Yanyun Qu ⋅ Lizhuang Ma ⋅ Yuan Xie
ExHall D Poster #440
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation Poster Session 6
Weijia Wu ⋅ Mingyu Liu ⋅ Zeyu Zhu ⋅ Haoen Feng ⋅ Xi Xia ⋅ Wen Wang ⋅ Kevin Qinghong Lin ⋅ Chunhua Shen ⋅ Mike Zheng Shou
ExHall D Poster #271
Zero-Shot Styled Text Image Generation, but Make It Autoregressive Poster Session 2
Vittorio Pippi ⋅ Fabio Quattrini ⋅ Silvia Cascianelli ⋅ Alessio Tonioni ⋅ Rita Cucchiara
ExHall D Poster #243
Event-based Video Super-Resolution via State Space Models Poster Session 3
Zeyu Xiao ⋅ Xinchao Wang
ExHall D Poster #182
BiM-VFI: Bidirectional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions Poster Session 2
Wonyong Seo ⋅ Jihyong Oh ⋅ Munchurl Kim
ExHall D Poster #180
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter Poster Session 4
Yaohua Zha ⋅ Yanzi Wang ⋅ Hang Guo ⋅ Jinpeng Wang ⋅ Tao Dai ⋅ Bin Chen ⋅ Zhihao Ouyang ⋅ Xue Yuerong ⋅ Ke Chen ⋅ Shu-Tao Xia
ExHall D Poster #111
FG^2: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching Poster Session 2
Zimin Xia ⋅ Alex Alahi
ExHall D Poster #94
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments Poster Session 3
Can Zhang ⋅ Gim Hee Lee
ExHall D Poster #143
Scaling Properties of Diffusion Models For Perceptual Tasks Poster Session 3
Rahul Ravishankar ⋅ Zeeshan Patel ⋅ Jathushan Rajasegaran ⋅ Jitendra Malik
ExHall D Poster #219
Population Normalization for Federated Learning Poster Session 2
Zhuoyao Wang ⋅ Fan Yi ⋅ Peizhu Gong ⋅ Caitou He ⋅ Cheng Jin ⋅ Weizhong Zhang
ExHall D Poster #461
Active Event-based Stereo Vision Poster Session 1
Jianing Li ⋅ Yunjian Zhang ⋅ Haiqian Han ⋅ Xiangyang Ji
ExHall D Poster #75
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Poster Session 6
Sitong Gong ⋅ Yunzhi Zhuge ⋅ Lu Zhang ⋅ Zongxin Yang ⋅ Pingping Zhang ⋅ Huchuan Lu
ExHall D Poster #290
ESC: Erasing Space Concept for Knowledge Deletion Poster Session 1
Tae-Young Lee ⋅ Sundong Park ⋅ Minwoo Jeon ⋅ Hyoseok Hwang ⋅ Gyeong-Moon Park
ExHall D Poster #463
Perceptual Video Compression with Neural Wrapping Poster Session 4
Muhammad Umar Karim Khan ⋅ Aaron Chadha ⋅ Mohammad Ashraful Anam ⋅ Yiannis Andreopoulos
ExHall D Poster #185
Towards More General Video-based Deepfake Detection through Facial Component Guided Adaptation for Foundation Model Poster Session 5
Yue-Hua Han ⋅ Tai-Ming Huang ⋅ Kailung Hua ⋅ Jun-Cheng Chen
ExHall D Poster #184
LaTexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending Poster Session 5
Jian Jin ⋅ Zhenbo Yu ⋅ Yang Shen ⋅ Zhenyong Fu ⋅ Jian Yang
ExHall D Poster #242
Generative Hard Example Augmentation for Semantic Point Cloud Segmentation Poster Session 5
Qi Zhang ⋅ Jibin Peng ⋅ Zhao Huang ⋅ Wei Feng ⋅ Di Lin
ExHall D Poster #110
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization Poster Session 2
Hongrui Jia ⋅ Chaoya Jiang ⋅ Haiyang Xu ⋅ Wei Ye ⋅ Mengfan Dong ⋅ Ming Yan ⋅ Ji Zhang ⋅ Fei Huang ⋅ Shikun Zhang
ExHall D Poster #380
A Unified Model for Compressed Sensing MRI Across Undersampling Patterns Poster Session 5
Armeet Singh Jatyani ⋅ Jiayun Wang ⋅ Aditi Chandrashekar ⋅ Zihui Wu ⋅ Miguel Liu-Schiaffini ⋅ Bahareh Tolooshams ⋅ Anima Anandkumar
ExHall D Poster #477
Geometry-guided Online 3D Video Synthesis with Multi-View Temporal Consistency Poster Session 3
Hyunho Ha ⋅ Lei Xiao ⋅ Christian Richardt ⋅ Thu Nguyen-Phuoc ⋅ Changil Kim ⋅ Min H. Kim ⋅ Douglas Lanman ⋅ Numair Khan
ExHall D Poster #59
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation Poster Session 6
Chuhao Chen ⋅ Zhiyang Dou ⋅ Chen Wang ⋅ Yiming Huang ⋅ Anjun Chen ⋅ Qiao Feng ⋅ Jiatao Gu ⋅ Lingjie Liu
ExHall D Poster #37
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization Poster Session 2
zefeng zhang ⋅ Hengzhu Tang ⋅ Jiawei Sheng ⋅ Zhenyu Zhang ⋅ YiMing Ren ⋅ Zhenyang Li ⋅ Dawei Yin ⋅ Duohe Ma ⋅ Tingwen Liu
ExHall D Poster #386
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video Poster Session 4
Andrea Boscolo Camiletto ⋅ Jian Wang ⋅ Eduardo Alvarado ⋅ Rishabh Dabral ⋅ Thabo Beeler ⋅ Marc Habermann ⋅ Christian Theobalt
ExHall D Poster #162
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning Poster Session 2
Jihyun Lee ⋅ Weipeng Xu ⋅ Alexander Richard ⋅ Shih-En Wei ⋅ Shunsuke Saito ⋅ Shaojie Bai ⋅ Te-Li Wang ⋅ Minhyuk Sung ⋅ Tae-Kyun Kim ⋅ Jason Saragih
ExHall D Poster #166
MDP: Multidimensional Vision Model Pruning with Latency Constraint Poster Session 4
Xinglong Sun ⋅ Barath Lakshmanan ⋅ Maying Shen ⋅ Shiyi Lan ⋅ Jingde Chen ⋅ Jose M. Alvarez
ExHall D Poster #411
End-to-End Implicit Neural Representations for Classification Poster Session 4
Alexander Gielisse ⋅ Jan van Gemert
ExHall D Poster #281
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge Poster Session 3
Xuan Shen ⋅ Weize Ma ⋅ Jing Liu ⋅ Changdi Yang ⋅ Rui Ding ⋅ Quanyi Wang ⋅ Henghui Ding ⋅ Wei Niu ⋅ Yanzhi Wang ⋅ Pu Zhao ⋅ Jun Lin ⋅ Jiuxiang Gu
ExHall D Poster #75
Polarized Color Screen Matting Poster Session 1
Kenji Enomoto ⋅ Scott Cohen ⋅ Brian Price ⋅ TJ Rhodes
ExHall D Poster #21
Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models Poster Session 6
Yuhao Cui ⋅ Xinxing Zu ⋅ Wenhua Zhang ⋅ Zhongzhou Zhao ⋅ Jinyang Gao
ExHall D Poster #344
DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation Poster Session 3
Chun-Hung Wu ⋅ Shih-Hong Chen ⋅ Chih Yao Hu ⋅ Hsin-Yu Wu ⋅ Kai-Hsin Chen ⋅ Yu-You Chen ⋅ Chih-Hai Su ⋅ Chih-Kuo Lee ⋅ Yu-Lun Liu
ExHall D Poster #482
STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training Poster Session 1
Haiyi Qiu ⋅ Minghe Gao ⋅ Long Qian ⋅ Kaihang Pan ⋅ Qifan Yu ⋅ Juncheng Li ⋅ Wenjie Wang ⋅ Siliang Tang ⋅ Yueting Zhuang ⋅ Tat-seng Chua
ExHall D Poster #298
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs Poster Session 2
Xiaoqin Wang ⋅ Xusen Ma ⋅ Xianxu Hou ⋅ Meidan Ding ⋅ Yudong Li ⋅ Junliang Chen ⋅ Wenting Chen ⋅ Xiaoyang Peng ⋅ Linlin Shen
ExHall D Poster #361
SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis Poster Session 5
Bangbang Zhou ⋅ Zuan Gao ⋅ Zixiao Wang ⋅ Boqiang Zhang ⋅ Yuxin Wang ⋅ Zhineng Chen ⋅ Hongtao Xie
ExHall D Poster #360
DrVideo: Document Retrieval Based Long Video Understanding Poster Session 4
Ziyu Ma ⋅ Chenhui Gou ⋅ Hengcan Shi ⋅ Bin Sun ⋅ Shutao Li ⋅ Hamid Rezatofighi ⋅ Jianfei Cai
ExHall D Poster #300
Event Ellipsometer: Event-based Mueller-Matrix Video Imaging Poster Session 5
Ryota Maeda ⋅ Yunseong Moon ⋅ Seung-Hwan Baek
ExHall D Poster #72
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects Poster Session 5
Lei Fan ⋅ Dongdong Fan ⋅ Zhiguang Hu ⋅ Yiwen Ding ⋅ Donglin Di ⋅ Kai Yi ⋅ Maurice Pagnucco ⋅ Yang Song
ExHall D Poster #428
Gradient-Guided Annealing for Domain Generalization Poster Session 4
Aristotelis Ballas ⋅ Christos Diou
ExHall D Poster #452
Continuous Locomotive Crowd Behavior Generation Poster Session 5
Inhwan Bae ⋅ Junoh Lee ⋅ Hae-Gon Jeon
ExHall D Poster #130
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation Poster Session 1
Zihao Zhang ⋅ Haoran Chen ⋅ Haoyu Zhao ⋅ Guansong Lu ⋅ Yanwei Fu ⋅ Hang Xu ⋅ Zuxuan Wu
ExHall D Poster #182
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Poster Session 2
Lunhao Duan ⋅ Shanshan Zhao ⋅ Wenjun Yan ⋅ Yinglun Li ⋅ Qing-Guo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang ⋅ Mingming Gong ⋅ Gui-Song Xia
ExHall D Poster #248
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos Poster Session 4
Tiantian Geng ⋅ Jinrui Zhang ⋅ Qingni Wang ⋅ Teng Wang ⋅ Jinming Duan ⋅ Feng Zheng
ExHall D Poster #302
CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images Poster Session 3
Chen Cheng ⋅ Jiacheng Wei ⋅ Tianrun Chen ⋅ Chi Zhang ⋅ Xiaofeng Yang ⋅ Shangzhan Zhang ⋅ Bingchen Yang ⋅ Chuan-Sheng Foo ⋅ Guosheng Lin ⋅ Qixing Huang ⋅ Fayao Liu
ExHall D Poster #40
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image Poster Session 6
Hui En Pang ⋅ Shuai Liu ⋅ Zhongang Cai ⋅ Lei Yang ⋅ Tianwei Zhang ⋅ Ziwei Liu
ExHall D Poster #14
Anatomical Consistency and Adaptive Prior-informed Transformation for Multi-contrast MR Image Synthesis via Diffusion Model Poster Session 6
Yejee Shin ⋅ Yeeun Lee ⋅ Hanbyol Jang ⋅ Geonhui Son ⋅ Hyeongyu Kim ⋅ Dosik Hwang
ExHall D Poster #456
SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks Poster Session 5
Shining Wang ⋅ Yunlong Wang ⋅ Ruiqi Wu ⋅ Bingliang Jiao ⋅ Wenxuan Wang ⋅ Peng Wang
ExHall D Poster #102
AlphaPre: Amplitude-Phase Disentanglement Model for Precipitation Nowcasting Poster Session 4
Kenghong Lin ⋅ Baoquan Zhang ⋅ Demin Yu ⋅ Wenzhi Feng ⋅ Shidong Chen ⋅ Feifan Gao ⋅ Xutao Li ⋅ Yunming Ye
ExHall D Poster #194
MODA: Motion-Drift Augmentation for Inertial Human Motion Analysis Poster Session 6
Yinghao Wu ⋅ Shihui Guo ⋅ Yipeng Qin
ExHall D Poster #153
Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation Poster Session 3
Jiaxin Cai ⋅ Jingze Su ⋅ Qi Li ⋅ Wenjie Yang ⋅ Shu Wang ⋅ Tiesong Zhao ⋅ Shengfeng He ⋅ Wenxi Liu
ExHall D Poster #410
TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model Poster Session 5
Meilong Xu ⋅ Saumya Gupta ⋅ Xiaoling Hu ⋅ Chen Li ⋅ Shahira Abousamra ⋅ Dimitris Samaras ⋅ Prateek Prasanna ⋅ Chao Chen
ExHall D Poster #458
LLM-driven Multimodal and Multi-Identity Listening Head Generation Poster Session 3
Peiwen Lai ⋅ Weizhi Zhong ⋅ Yipeng Qin ⋅ Xiaohang Ren ⋅ Baoyuan Wang ⋅ Guanbin Li
ExHall D Poster #1
GOAL: Global-local Object Alignment Learning Poster Session 1
Hyungyu Choi ⋅ Young Kyun Jang ⋅ Chanho Eom
ExHall D Poster #372
MEGA: Masked Generative Autoencoder for Human Mesh Recovery Poster Session 2
Guénolé Fiche ⋅ Simon Leglaive ⋅ Xavier Alameda-Pineda ⋅ Francesc Moreno-Noguer
ExHall D Poster #90
Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model Poster Session 5
Zheyu Zhang ⋅ Yayuan Lu ⋅ Feipeng Ma ⋅ Yueyi Zhang ⋅ Huanjing Yue ⋅ Xiaoyan Sun
ExHall D Poster #475
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy Poster Session 1
Xingchao Yang ⋅ Takafumi Taketomi ⋅ Yuki Endo ⋅ Yoshihiro Kanamori
ExHall D Poster #15
CADDreamer: CAD Object Generation from Single-view Images Poster Session 5
Yuan Li ⋅ Cheng Lin ⋅ Yuan Liu ⋅ Xiaoxiao Long ⋅ Chenxu Zhang ⋅ Ningna Wang ⋅ Xin Li ⋅ Wenping Wang ⋅ Xiaohu Guo
ExHall D Poster #38
Vision-Language Model IP Protection via Prompt-based Learning Poster Session 2
Lianyu Wang ⋅ Meng Wang ⋅ Huazhu Fu ⋅ Daoqiang Zhang
ExHall D Poster #393
Where's the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content Poster Session 6
Haoyue Bai ⋅ Yiyou Sun ⋅ Wei Cheng ⋅ Haifeng Chen
ExHall D Poster #253
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations Poster Session 5
Krishna Sri Ipsit Mantri ⋅ Carola-Bibiane Schönlieb ⋅ Bruno Ribeiro ⋅ Chaim Baskin ⋅ Moshe Eliasof
ExHall D Poster #399
AvatarArtist: Open-Domain 4D Avatarization Poster Session 3
Hongyu Liu ⋅ Xuan Wang ⋅ Ziyu Wan ⋅ Yue Ma ⋅ Jingye Chen ⋅ Yanbo Fan ⋅ Yujun Shen ⋅ Yibing Song ⋅ Qifeng Chen
ExHall D Poster #10
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models Poster Session 5
Zhendong Wang ⋅ Jianmin Bao ⋅ Shuyang Gu ⋅ Dong Chen ⋅ Wengang Zhou ⋅ Houqiang Li
ExHall D Poster #239
Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing Poster Session 4
Chen Liao ⋅ Yan Shen ⋅ Dan Li ⋅ Zhongli Wang
ExHall D Poster #209
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content Poster Session 2
Qiuheng Wang ⋅ Yukai Shi ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Ke Lin ⋅ Jiahao Wang ⋅ Boyuan Jiang ⋅ Haotian Yang ⋅ Mingwu Zheng ⋅ Xin Tao ⋅ Fei Yang ⋅ Pengfei Wan ⋅ Di ZHANG
ExHall D Poster #292
Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation Poster Session 6
Ying Jin ⋅ Jinlong Peng ⋅ Qingdong He ⋅ Teng Hu ⋅ Jiafu Wu ⋅ Hao Chen ⋅ Haoxuan Wang ⋅ wenbing zhu ⋅ Mingmin Chi ⋅ Jun Liu ⋅ Yabiao Wang
ExHall D Poster #409
Semantic and Sequential Alignment for Referring Video Object Segmentation Poster Session 4
Feiyu Pan ⋅ Hao Fang ⋅ Fangkai Li ⋅ Yanyu Xu ⋅ Yawei Li ⋅ Luca Benini ⋅ Xiankai Lu
ExHall D Poster #312
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models Poster Session 4
Yahan Tu ⋅ Rui Hu ⋅ Jitao Sang
ExHall D Poster #384
Self-Supervised Learning for Color Spike Camera Reconstruction Poster Session 2
Yanchen Dong ⋅ Ruiqin Xiong ⋅ Xiaopeng Fan ⋅ Zhaofei Yu ⋅ Yonghong Tian ⋅ Tiejun Huang
ExHall D Poster #76
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized? Poster Session 5
Jianyang Xie ⋅ Yitian Zhao ⋅ Yanda Meng ⋅ He Zhao ⋅ Anh Nguyen ⋅ Yalin Zheng
ExHall D Poster #314
DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers Poster Session 1
Li Ren ⋅ Chen Chen ⋅ Liqiang Wang ⋅ Kien A. Hua
ExHall D Poster #402
MambaIRv2: Attentive State Space Restoration Poster Session 6
Hang Guo ⋅ Yong Guo ⋅ Yaohua Zha ⋅ Yulun Zhang ⋅ Wenbo Li ⋅ Tao Dai ⋅ Shu-Tao Xia ⋅ Yawei Li
ExHall D Poster #186
Towards Lossless Implicit Neural Representation via Bit Plane Decomposition Poster Session 1
Woo Kyoung Han ⋅ Byeonghun Lee ⋅ Hyunmin Cho ⋅ Sunghoon Im ⋅ Kyong Hwan Jin
ExHall D Poster #198
CoMatcher: Multi-View Collaborative Feature Matching Poster Session 5
Jintao Zhang ⋅ Zimin Xia ⋅ Mingyue Dong ⋅ Shuhan Shen ⋅ Linwei Yue ⋅ Xianwei Zheng
ExHall D Poster #88
Spectral State Space Model for Rotation-Invariant Visual Representation Learning Poster Session 5
Sahar Dastani ⋅ Ali Bahri ⋅ Moslem Yazdanpanah ⋅ Mehrdad Noori ⋅ David OSOWIECHI ⋅ Gustavo Vargas Hakim ⋅ Farzad Beizaee ⋅ Milad Cheraghalikhani ⋅ Arnab Mondal ⋅ Herve Lombaert ⋅ Christian Desrosiers
ExHall D Poster #274
AffordDP: Generalizable Diffusion Policy with Transferable Affordance Poster Session 2
Shijie Wu ⋅ Yihang Zhu ⋅ Yunao Huang ⋅ Kaizhen Zhu ⋅ Jiayuan Gu ⋅ Jingyi Yu ⋅ Ye Shi ⋅ Jingya Wang
ExHall D Poster #153
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation Poster Session 1
Hermann Kumbong ⋅ Xian Liu ⋅ Tsung-Yi Lin ⋅ Ming-Yu Liu ⋅ Xihui Liu ⋅ Ziwei Liu ⋅ Daniel Y Fu ⋅ Christopher Re ⋅ David W. Romero
ExHall D Poster #227
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning Poster Session 5
Shihao Wang ⋅ Zhiding Yu ⋅ Xiaohui Jiang ⋅ Shiyi Lan ⋅ Min Shi ⋅ Nadine Chang ⋅ Jan Kautz ⋅ Ying Li ⋅ Jose M. Alvarez
ExHall D Poster #132
MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing Poster Session 6
Cong Wang ⋅ Di Kang ⋅ Heyi Sun ⋅ SHENHAN QIAN ⋅ Zixuan Wang ⋅ Linchao Bao ⋅ Song-Hai Zhang
ExHall D Poster #7
Image Generation Diversity Issues and How to Tame Them Poster Session 1
Mischa Dombrowski ⋅ Weitong Zhang ⋅ Hadrien Reynaud ⋅ Sarah Cechnicka ⋅ Bernhard Kainz
ExHall D Poster #274
Annotation Ambiguity Aware Semi-Supervised Medical Image Segmentation Poster Session 2
Suruchi Kumari ⋅ Pravendra Singh
ExHall D Poster #479
CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models Poster Session 2
Felix Taubner ⋅ Ruihang Zhang ⋅ Mathieu Tuli ⋅ David B. Lindell
ExHall D Poster #9
Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers Poster Session 5
Jung-Ho Hong ⋅ Ho-Joong Kim ⋅ Kyu-Sung Jeon ⋅ Seong-Whan Lee
ExHall D Poster #394
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit Poster Session 3
Benquan Wang ⋅ Ruyi An ⋅ Jin-Kyu So ⋅ Sergei Kurdiumov ⋅ Eng Aik Chan ⋅ Giorgio Adamo ⋅ Yuhan Peng ⋅ Yewen Li ⋅ Bo An
ExHall D Poster #23
Dataset Distillation with Neural Characteristic Function: A Minmax Perspective Poster Session 5
Shaobo Wang ⋅ Yicun Yang ⋅ Zhiyuan Liu ⋅ Chenghao Sun ⋅ Xuming Hu ⋅ Conghui He ⋅ Linfeng Zhang
ExHall D Poster #433
Free-viewpoint Human Animation with Pose-correlated Reference Selection Poster Session 6
Fa-Ting Hong ⋅ Zhan Xu ⋅ Haiyang Liu ⋅ Qinjie Lin ⋅ Luchuan Song ⋅ ZHIXIN SHU ⋅ Yang Zhou ⋅ Duygu Ceylan ⋅ Dan Xu
ExHall D Poster #5
Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking Poster Session 3
Hongkai Wei ⋅ YANG YANG ⋅ Shijie Sun ⋅ Mingtao Feng ⋅ Xiangyu Song ⋅ Qi Lei ⋅ Hongli Hu ⋅ Rong Wang ⋅ Huansheng Song ⋅ Naveed Akhtar ⋅ Ajmal Mian
ExHall D Poster #309
Towards Universal Dataset Distillation via Task-Driven Diffusion Poster Session 3
Ding Qi ⋅ Jian Li ⋅ Junyao Gao ⋅ Shuguang Dou ⋅ Ying Tai ⋅ Jianlong Hu ⋅ Bo Zhao ⋅ Yabiao Wang ⋅ Chengjie Wang ⋅ Cai Rong Zhao
ExHall D Poster #262
Parametric Point Cloud Completion for Polygonal Surface Reconstruction Poster Session 3
Zhaiyu Chen ⋅ Yuqing Wang ⋅ Liangliang Nan ⋅ Xiao Xiang Zhu
ExHall D Poster #106
SEAL: Semantic Attention Learning for Long Video Representation Poster Session 6
Lan Wang ⋅ Yujia Chen ⋅ Wen-Sheng Chu ⋅ Vishnu Naresh Boddeti ⋅ Du Tran
ExHall D Poster #268
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping Poster Session 1
Pascal Chang ⋅ Sergio Sancho ⋅ Jingwei Tang ⋅ Markus Gross ⋅ Vinicius C. Azevedo
ExHall D Poster #215
Paint by Inpaint: Learning to Add Image Objects by Removing Them First Poster Session 4
Navve Wasserman ⋅ Noam Rotstein ⋅ Roy Ganz ⋅ Ron Kimmel
ExHall D Poster #240
Dual Semantic Guidance for Open Vocabulary Semantic Segmentation Poster Session 4
ZhengYang Wang ⋅ Tingliang Feng ⋅ Fan Lyu ⋅ Fanhua Shang ⋅ Wei Feng ⋅ Liang Wan
ExHall D Poster #420
TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception Poster Session 3
Zhiying Song ⋅ Lei Yang ⋅ Fuxi Wen ⋅ Jun Li
ExHall D Poster #135
DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching Poster Session 3
Emanuele Aiello ⋅ Umberto Michieli ⋅ Diego Valsesia ⋅ Mete Ozay ⋅ Enrico Magli
ExHall D Poster #174
3D Gaussian Inpainting with Depth-Guided Cross-View Consistency Poster Session 6
Sheng-Yu Huang ⋅ Zi-Ting Chou ⋅ Yu-Chiang Frank Wang
ExHall D Poster #52
Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation Poster Session 5
David T. Hoffmann ⋅ Syed Haseeb Raza ⋅ Hanqiu Jiang ⋅ Steffen Klingenhoefer ⋅ Denis Tananaev ⋅ Martin Meinke
ExHall D Poster #122
Identity-preserving Distillation Sampling by Fixed-Point Iterator Poster Session 3
SeonHwa Kim ⋅ Jiwon Kim ⋅ Soobin Park ⋅ Donghoon Ahn ⋅ Jiwon Kang ⋅ Seungryong Kim ⋅ Kyong Hwan Jin ⋅ Eunju Cha
ExHall D Poster #44
VladVA: Discriminative Fine-tuning of LVLMs Poster Session 1
Yassine Ouali ⋅ Adrian Bulat ⋅ ALEXANDROS XENOS ⋅ Anestis Zaganidis ⋅ Ioannis Maniadis Metaxas ⋅ Brais Martinez ⋅ Georgios Tzimiropoulos
ExHall D Poster #375
Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction Poster Session 1
Cecilia Curreli ⋅ Dominik Muhle ⋅ Abhishek Saroha ⋅ Zhenzhang Ye ⋅ Riccardo Marin ⋅ Daniel Cremers
ExHall D Poster #158
LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene Poster Session 4
Xiaoyu Zhang ⋅ Weihong Pan ⋅ Chong Bao ⋅ Xiyu Zhang ⋅ Xiaojun Xiang ⋅ Hanqing Jiang ⋅ Hujun Bao
ExHall D Poster #26
PICO: Reconstructing 3D People In Contact with Objects Poster Session 1
Alpár Cseke ⋅ Shashank Tripathi ⋅ Sai Kumar Dwivedi ⋅ Arjun Lakshmipathy ⋅ Agniv Chatterjee ⋅ Michael J. Black ⋅ Dimitrios Tzionas
ExHall D Poster #150
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features Poster Session 5
Dana Cohen-Bar ⋅ Daniel Cohen-Or ⋅ Gal Chechik ⋅ Yoni Kasten
ExHall D Poster #34
Do Your Best and Get Enough Rest for Continual Learning Poster Session 2
Hankyul Kang ⋅ Gregor Seifer ⋅ Donghyun Lee ⋅ Jongbin Ryu
ExHall D Poster #448
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation Poster Session 3
Xin Zhang ⋅ Robby T. Tan
ExHall D Poster #370
Vision-Guided Action: Enhancing 3D Human Motion Prediction with Gaze-informed Affordance in 3D Scenes Poster Session 3
Ting Yu ⋅ Yi Lin ⋅ Jun Yu ⋅ Zhenyu Lou ⋅ Qiongjie Cui
ExHall D Poster #161
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes Poster Session 6
Xiang Xu ⋅ Lingdong Kong ⋅ hui shuai ⋅ Liang Pan ⋅ Ziwei Liu ⋅ Qingshan Liu
ExHall D Poster #116
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models Poster Session 1
Fu Feng ⋅ Yucheng Xie ⋅ Jing Wang ⋅ Xin Geng
ExHall D Poster #445
PI-HMR: Towards Robust In-bed Temporal Human Shape Reconstruction with Contact Pressure Sensing Poster Session 6
Ziyu Wu ⋅ Yufan Xiong ⋅ Mengting Niu ⋅ Fangting Xie ⋅ Quan Wan ⋅ Qijun Ying ⋅ Boyan Liu ⋅ Xiaohui Cai
ExHall D Poster #150
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset Poster Session 1
Xiao Wang ⋅ Fuling Wang ⋅ Yuehang Li ⋅ Qingchuan Ma ⋅ Shiao Wang ⋅ Bo Jiang ⋅ Jin Tang
ExHall D Poster #474
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding Poster Session 3
Chaoyu Li ⋅ Eun Woo Im ⋅ Pooyan Fazli
ExHall D Poster #294
From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification Poster Session 2
Yan Jiang ⋅ Hao Yu ⋅ Xu Cheng ⋅ Haoyu Chen ⋅ Zhaodong Sun ⋅ Guoying Zhao
ExHall D Poster #330
AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward Poster Session 5
Haonan Han ⋅ Xiangzuo Wu ⋅ Huan Liao ⋅ Zunnan Xu ⋅ Zhongyuan Hu ⋅ Ronghui Li ⋅ Yachao Zhang ⋅ Xiu Li
ExHall D Poster #160
4Deform: Neural Surface Deformation for Robust Shape Interpolation Poster Session 2
Lu Sang ⋅ Zehranaz Canfes ⋅ Dongliang Cao ⋅ Riccardo Marin ⋅ Florian Bernard ⋅ Daniel Cremers
ExHall D Poster #111
TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation Poster Session 5
Abduljalil Radman ⋅ Jorma Laaksonen
ExHall D Poster #280
Dense Match Summarization for Faster Two-view Estimation Poster Session 1
Jonathan Astermark ⋅ Anders Heyden ⋅ Viktor Larsson
ExHall D Poster #86
MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction Poster Session 1
Gangjian Zhang ⋅ Nanjie Yao ⋅ Shunsi Zhang ⋅ hanfeng Zhao ⋅ Guoliang Pang ⋅ Jian Shu ⋅ Hao Wang
ExHall D Poster #16
Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing Poster Session 1
Shengzhi Wang ⋅ Yingkang Zhong ⋅ Jiangchuan Mu ⋅ Kai WU ⋅ Mingliang Xiong ⋅ Wen Fang ⋅ Mingqing Liu ⋅ Hao Deng ⋅ Bin He ⋅ Gang Li ⋅ Qingwen Liu
ExHall D Poster #179
Interpreting Object-level Foundation Models via Visual Precision Search Poster Session 6
Ruoyu Chen ⋅ Siyuan Liang ⋅ Jingzhi Li ⋅ Shiming Liu ⋅ Maosen Li ⋅ Zhen Huang ⋅ Hua Zhang ⋅ Xiaochun Cao
ExHall D Poster #372
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Poster Session 1
Guangda Ji ⋅ Silvan Weder ⋅ Francis Engelmann ⋅ Marc Pollefeys ⋅ Hermann Blum
ExHall D Poster #406
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Poster Session 3
JUNSEONG KIM ⋅ GeonU Kim ⋅ Kim Yu-Ji ⋅ Yu-Chiang Frank Wang ⋅ Jaesung Choe ⋅ Tae-Hyun Oh
ExHall D Poster #334
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes Poster Session 4
Kaiwei Zhang ⋅ Dandan Zhu ⋅ Xiongkuo Min ⋅ Guangtao Zhai
ExHall D Poster #35
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect Detection Poster Session 5
Feng Yan ⋅ Xiaoheng Jiang ⋅ Yang Lu ⋅ Jiale Cao ⋅ Dong Chen ⋅ Mingliang Xu
ExHall D Poster #272
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM Poster Session 4
Wang Jiarui ⋅ Huiyu Duan ⋅ Guangtao Zhai ⋅ Juntong Wang ⋅ Xiongkuo Min
ExHall D Poster #294
GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection Poster Session 1
Jeffri Erwin Murrugarra Llerena ⋅ José Henrique Marques ⋅ Claudio Jung
ExHall D Poster #326
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering Poster Session 6
Yuanhao Zou ⋅ Zhaozheng Yin
ExHall D Poster #332
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs Poster Session 1
Zicheng Zhang ⋅ Ziheng Jia ⋅ Haoning Wu ⋅ Chunyi Li ⋅ Zijian Chen ⋅ Yingjie Zhou ⋅ Wei Sun ⋅ Xiaohong Liu ⋅ Xiongkuo Min ⋅ Weisi Lin ⋅ Guangtao Zhai
ExHall D Poster #293
Leveraging SD Map to Augment HD Map-based Trajectory Prediction Poster Session 4
Zhiwei Dong ⋅ Ran Ding ⋅ Wei Li ⋅ Zhang Peng ⋅ Guobin Tang ⋅ Jia Guo
ExHall D Poster #134
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video Poster Session 1
Qiang Hu ⋅ Zihan Zheng ⋅ Houqiang Zhong ⋅ Sihua Fu ⋅ Li Song ⋅ Xiaoyun Zhang ⋅ Guangtao Zhai ⋅ Yanfeng Wang
ExHall D Poster #66
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation Poster Session 4
Yiming Qin ⋅ Zhu Xu ⋅ Yang Liu
ExHall D Poster #262
MAD: Memory-Augmented Detection of 3D Objects Poster Session 1
Ben Agro ⋅ Sergio Casas ⋅ Patrick Wang ⋅ Thomas Gilles ⋅ Raquel Urtasun
ExHall D Poster #120
MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos Poster Session 3
Zhengqi Li ⋅ Richard Tucker ⋅ Forrester Cole ⋅ Qianqian Wang ⋅ Linyi Jin ⋅ Vickie Ye ⋅ Angjoo Kanazawa ⋅ Aleksander Holynski ⋅ Noah Snavely
ExHall D Poster #78
Distilled Prompt Learning for Incomplete Multimodal Survival Prediction Poster Session 1
Yingxue Xu ⋅ Fengtao ZHOU ⋅ Chenyu Zhao ⋅ Yihui Wang ⋅ Can Yang ⋅ Hao Chen
ExHall D Poster #472
PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation Poster Session 5
Xinting Hu ⋅ Haoran Wang ⋅ Jan Lenssen ⋅ Bernt Schiele
ExHall D Poster #264
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models Poster Session 5
Jinhui Yi ⋅ Syed Talal Wasim ⋅ Yanan Luo ⋅ Muzammal Naseer ⋅ Jürgen Gall
ExHall D Poster #296
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Poster Session 4
Yuxuan Wang ⋅ Yueqian Wang ⋅ Bo Chen ⋅ Tong Wu ⋅ Dongyan Zhao ⋅ Zilong Zheng
ExHall D Poster #299
AirRoom: Objects Matter in Room Reidentification Poster Session 1
Runmao Yao ⋅ Yi Du ⋅ Zhuoqun Chen ⋅ Haoze Zheng ⋅ Chen Wang
ExHall D Poster #113
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach Poster Session 1
Lingchen Sun ⋅ Rongyuan Wu ⋅ Zhiyuan Ma ⋅ Shuaizheng Liu ⋅ Qiaosi Yi ⋅ Lei Zhang
ExHall D Poster #204
SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization Poster Session 4
Yi Du ⋅ Zhipeng Zhao ⋅ Shaoshu Su ⋅ Sharath Golluri ⋅ Haoze Zheng ⋅ Runmao Yao ⋅ Chen Wang
ExHall D Poster #109
Interpretable Image Classification via Non-parametric Part Prototype Learning Poster Session 2
Zhijie Zhu ⋅ Lei Fan ⋅ Maurice Pagnucco ⋅ Yang Song
ExHall D Poster #418
Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples Poster Session 3
WEIWEI LI ⋅ Junzhuo Liu ⋅ Yuanyuan Ren ⋅ Yuchen Zheng ⋅ Yahao Liu ⋅ Wen Li
ExHall D Poster #463
Mr. DETR: Instructive Multi-Route Training for Detection Transformers Poster Session 2
Chang-Bin Zhang ⋅ Yujie Zhong ⋅ Kai Han
ExHall D Poster #434
Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models Poster Session 5
Davide Berasi ⋅ Matteo Farina ⋅ Massimiliano Mancini ⋅ Elisa Ricci ⋅ Nicola Strisciuglio
ExHall D Poster #371
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction Poster Session 3
Enrico Pallotta ⋅ Sina Mokhtarzadeh Azar ⋅ Shuai Li ⋅ Olga Zatsarynna ⋅ Jürgen Gall
ExHall D Poster #300
HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics Poster Session 4
Jongsung Lee ⋅ HARIN PARK ⋅ Byeong-Uk Lee ⋅ Kyungdon Joo
ExHall D Poster #73
SkillMimic: Learning Basketball Interaction Skills from Demonstrations Poster Session 4
Yinhuai Wang ⋅ Qihan Zhao ⋅ Runyi Yu ⋅ Hok Wai Tsui ⋅ Ailing Zeng ⋅ Jing Lin ⋅ Zhengyi Luo ⋅ Jiwen Yu ⋅ Xiu Li ⋅ Qifeng Chen ⋅ Jian Zhang ⋅ Lei Zhang ⋅ Ping Tan
ExHall D Poster #166
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding Poster Session 1
Aaryan Garg ⋅ Akash Kumar ⋅ Yogesh S. Rawat
ExHall D Poster #307
SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis Poster Session 1
Junho Kim ⋅ Hyunjun Kim ⋅ Hosu Lee ⋅ Yong Man Ro
ExHall D Poster #304
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars Poster Session 3
Linzhou Li ⋅ Yumeng Li ⋅ Yanlin Weng ⋅ Youyi Zheng ⋅ Kun Zhou
ExHall D Poster #9
EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark Poster Session 3
Ming Li ⋅ Jike Zhong ⋅ Tianle Chen ⋅ Yuxiang Lai ⋅ Konstantinos Psounis
ExHall D Poster #256
Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction Poster Session 6
Shanshan Huang ⋅ Haoxuan Li ⋅ Chunyuan Zheng ⋅ Mingyuan Ge ⋅ WeiGao ⋅ Lei Wang ⋅ Li Liu
ExHall D Poster #244
Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion Poster Session 5
Jona Ballé ⋅ Luca Versari ⋅ Emilien Dupont ⋅ Hyunjik Kim ⋅ Matthias Bauer
ExHall D Poster #210
AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting Poster Session 4
Chung-Ho Wu ⋅ Yang-Jung Chen ⋅ Ying-Huan Chen ⋅ Jie-Ying Lee ⋅ Bo-Hsu Ke ⋅ Chun-Wei Tuan Mu ⋅ Yichuan Huang ⋅ Chin-Yang Lin ⋅ Min-Hung Chen ⋅ Yen-Yu Lin ⋅ Yu-Lun Liu
ExHall D Poster #50
Towards Realistic Example-based Modeling via 3D Gaussian Stitching Poster Session 6
Xinyu Gao ⋅ Ziyi Yang ⋅ Bingchen Gong ⋅ Xiaoguang Han ⋅ Sipeng Yang ⋅ Xiaogang Jin
ExHall D Poster #42
WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images Poster Session 3
Shifan Zhang ⋅ Hongzi Zhu ⋅ Yinan He ⋅ Minyi Guo ⋅ Ziyang Lou ⋅ Shan Chang
ExHall D Poster #424
Saliuitl: Ensemble Salience Guided Recovery of Adversarial Patches against CNNs Poster Session 4
Mauricio Byrd Victorica ⋅ György Dán ⋅ Henrik Sandberg
ExHall D Poster #434
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach Poster Session 1
Jing Bi ⋅ Lianggong Bruce Wen ⋅ Zhang Liu ⋅ JunJia Guo ⋅ Yunlong Tang ⋅ Bingjie Wang ⋅ Chenliang Xu
ExHall D Poster #378
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models Poster Session 3
Jaerin Lee ⋅ Daniel Jung ⋅ Kanggeon Lee ⋅ Kyoung Mu Lee
ExHall D Poster #226
Learning Partonomic 3D Reconstruction from Image Collections Poster Session 6
Xiaoqian Ruan ⋅ Pei Yu ⋅ Dian Jia ⋅ Hyeonjeong Park ⋅ Peixi Xiong ⋅ Wei Tang
ExHall D Poster #56
EVOS: Efficient Implicit Neural Training via EVOlutionary Selector Poster Session 6
Weixiang Zhang ⋅ Shuzhao Xie ⋅ Chengwei Ren ⋅ Siyi Xie ⋅ Chen Tang ⋅ Shijia Ge ⋅ Mingzi Wang ⋅ Zhi Wang
ExHall D Poster #414
CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation Poster Session 5
Kai Fang ⋅ Anqi Zhang ⋅ Guangyu Gao ⋅ Jianbo Jiao ⋅ Chi Harold Liu ⋅ Yunchao Wei
ExHall D Poster #442
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation Poster Session 4
Henghui Du ⋅ Guangyao Li ⋅ Chang Zhou ⋅ Chunjie Zhang ⋅ Alan Zhao ⋅ Di Hu
ExHall D Poster #288
Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection Poster Session 3
Le Yang ⋅ Ziwei Zheng ⋅ Boxu Chen ⋅ Zhengyu Zhao ⋅ Chenhao Lin ⋅ Chao Shen
ExHall D Poster #382
UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation Poster Session 3
Yinqiao Wang ⋅ Hao Xu ⋅ Pheng-Ann Heng ⋅ Chi-Wing Fu
ExHall D Poster #152
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Poster Session 3
Uri Gadot ⋅ Shie Mannor ⋅ Assaf Shocher ⋅ Gal Chechik ⋅ Assaf Hallak
ExHall D Poster #179
PHGC: Procedural Heterogeneous Graph Completion for Natural Language Task Verification in Egocentric Videos Poster Session 2
Xun Jiang ⋅ Zhiyi Huang ⋅ Xing Xu ⋅ Jingkuan Song ⋅ Fumin Shen ⋅ Heng Tao Shen
ExHall D Poster #309
Recognition-Synergistic Scene Text Editing Poster Session 3
Zhengyao Fang ⋅ Pengyuan Lyu ⋅ Jingjing Wu ⋅ Chengquan Zhang ⋅ Jun Yu ⋅ Guangming Lu ⋅ Wenjie Pei
ExHall D Poster #234
Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval Poster Session 4
Siyuan Duan ⋅ Yuan Sun ⋅ Dezhong Peng ⋅ Zheng Liu ⋅ Xiaomin Song ⋅ Peng Hu
ExHall D Poster #470
GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration Poster Session 4
Yuchen Sun ⋅ Shanhui Zhao ⋅ Tao Yu ⋅ Hao Wen ⋅ Samith Va ⋅ Mengwei Xu ⋅ Yuanchun Li ⋅ Chongyang Zhang
ExHall D Poster #350
Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization Poster Session 2
lingyun zhang ⋅ Yu Xie ⋅ Yanwei Fu ⋅ Ping Chen
ExHall D Poster #267
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model Poster Session 1
Xi Wang ⋅ Hongzhen Li ⋅ Heng Fang ⋅ YICHEN PENG ⋅ Haoran Xie ⋅ Xi Yang ⋅ Chuntao Li
ExHall D Poster #263
UniPhy: Learning a Unified Constitutive Model for Inverse Physics Simulation Poster Session 4
Himangi Mittal ⋅ Peiye Zhuang ⋅ Hsin-Ying Lee ⋅ Shubham Tulsiani
ExHall D Poster #34
SparseAlign: a Fully Sparse Framework for Cooperative Object Detection Poster Session 5
Yunshuang Yuan ⋅ Yan Xia ⋅ Daniel Cremers ⋅ Monika Sester
ExHall D Poster #119
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion Poster Session 4
Chaoyang Wang ⋅ Peiye Zhuang ⋅ Tuan Duc Ngo ⋅ Willi Menapace ⋅ Aliaksandr Siarohin ⋅ Michael Vasilkovsky ⋅ Ivan Skorokhodov ⋅ Sergey Tulyakov ⋅ Peter Wonka ⋅ Hsin-Ying Lee
ExHall D Poster #183
Scene Map-based Prompt Tuning for Navigation Instruction Generation Poster Session 2
Sheng Fan ⋅ Rui Liu ⋅ Wenguan Wang ⋅ Yi Yang
ExHall D Poster #146
Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond) Poster Session 1
Tomer Garber ⋅ Tom Tirer
ExHall D Poster #210
HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models Poster Session 2
Mingzhen Huang ⋅ Fu-Jen Chu ⋅ Bugra Tekin ⋅ Kevin Liang ⋅ Haoyu Ma ⋅ Weiyao Wang ⋅ Xingyu Chen ⋅ Pierre Gleize ⋅ Hongfei Xue ⋅ Siwei Lyu ⋅ Kris Kitani ⋅ Matt Feiszli ⋅ Hao Tang
ExHall D Poster #170
I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models Poster Session 3
Dongnan Gui ⋅ Xun Guo ⋅ Wengang Zhou ⋅ Yan Lu
ExHall D Poster #186
Rashomon Sets for Prototypical-Part Networks: Editing Interpretable Models in Real-Time Poster Session 1
Jon Donnelly ⋅ Zhicheng Guo ⋅ Alina Jade Barnett ⋅ Hayden McTavish ⋅ Chaofan Chen ⋅ Cynthia Rudin
ExHall D Poster #418
HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery Poster Session 2
Yuto Matsubara ⋅ Ko Nishino
ExHall D Poster #99
GPS as a Control Signal for Image Generation Poster Session 1
Chao Feng ⋅ Ziyang Chen ⋅ Aleksander Holynski ⋅ Alexei A. Efros ⋅ Andrew Owens
ExHall D Poster #250
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM Poster Session 2
Vladimir Yugay ⋅ Theo Gevers ⋅ Martin R. Oswald
ExHall D Poster #131
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Poster Session 4
Mark Boss ⋅ Zixuan Huang ⋅ Aaryaman Vasishta ⋅ Varun Jampani
ExHall D Poster #37
Towards Precise Embodied Dialogue Localization via Causality Guided Diffusion Poster Session 3
Haoyu Wang ⋅ Le Wang ⋅ Sanping Zhou ⋅ Jingyi Tian ⋅ Zheng Qin ⋅ Yabing Wang ⋅ Gang Hua ⋅ Wei Tang
ExHall D Poster #257
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation Poster Session 6
Yukang Lin ⋅ Hokit Fung ⋅ Jianjin Xu ⋅ Zeping Ren ⋅ Adela S.M. Lau ⋅ Guosheng Yin ⋅ Xiu Li
ExHall D Poster #4
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation Poster Session 6
Miroslav Purkrábek ⋅ Jiri Matas
ExHall D Poster #93
Model Poisoning Attacks to Federated Learning via Multi-Round Consistency Poster Session 3
Yueqi Xie ⋅ Minghong Fang ⋅ Neil Zhenqiang Gong
ExHall D Poster #460
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition Poster Session 1
Wen Yin ⋅ Yong Wang ⋅ Guiduo Duan ⋅ Dongyang Zhang ⋅ XIN Hu ⋅ Yuan-Fang Li ⋅ Tao He
ExHall D Poster #354
Distilling Multi-modal Large Language Models for Autonomous Driving Poster Session 6
Deepti Hegde ⋅ Rajeev Yasarla ⋅ Hong Cai ⋅ Shizhong Han ⋅ Apratim Bhattacharyya ⋅ Shweta Mahajan ⋅ Litian Liu ⋅ Risheek Garrepalli ⋅ Vishal M. Patel ⋅ Fatih Porikli
ExHall D Poster #135
Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision Poster Session 3
Jinneyong Kim ⋅ Seung-Hwan Baek
ExHall D Poster #80
Camera Resection from Known Line Pencils and a Radially Distorted Scanline Poster Session 4
Juan Carlos Dibene Simental ⋅ Enrique Dunn
ExHall D Poster #80
LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos Poster Session 2
Daniel Etaat ⋅ Dvij Rajesh Kalaria ⋅ Nima Rahmanian ⋅ Shankar Sastry
ExHall D Poster #168
MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities Poster Session 3
Federico Lincetto ⋅ Gianluca Agresti ⋅ Mattia Rossi ⋅ Pietro Zanuttigh
ExHall D Poster #29
Dense-SfM: Structure from Motion with Dense Consistent Matching Poster Session 2
JongMin Lee ⋅ Sungjoo Yoo
ExHall D Poster #98
FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video Poster Session 6
Yue Gao ⋅ Hong-Xing Yu ⋅ Bo Zhu ⋅ Jiajun Wu
ExHall D Poster #32
SimVS: Simulating World Inconsistencies for Robust View Synthesis Poster Session 4
Alex Trevithick ⋅ Roni Paiss ⋅ Philipp Henzler ⋅ Dor Verbin ⋅ Rundi Wu ⋅ Hadi Alzayer ⋅ Ruiqi Gao ⋅ Ben Poole ⋅ Jonathan T. Barron ⋅ Aleksander Holynski ⋅ Ravi Ramamoorthi ⋅ Pratul P. Srinivasan
ExHall D Poster #60
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning Poster Session 5
Qianli Ma ⋅ Xuefei Ning ⋅ Dongrui Liu ⋅ Li Niu ⋅ Linfeng Zhang
ExHall D Poster #212
HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation Poster Session 6
Mehdi Zayene ⋅ Albias Havolli ⋅ Jannik Endres ⋅ Charles Corbière ⋅ Alexandre Ben Ahmed Kontouli ⋅ Salim Cherkaoui ⋅ Alex Alahi
ExHall D Poster #79
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation Poster Session 3
Ning Gao ⋅ Yilun Chen ⋅ Shuai Yang ⋅ Xinyi Chen ⋅ Yang Tian ⋅ Hao Li ⋅ Haifeng Huang ⋅ Hanqing Wang ⋅ Tai Wang ⋅ Jiangmiao Pang
ExHall D Poster #148
FrugalNeRF: Fast Convergence for Extreme Few-shot Novel View Synthesis without Learned Priors Poster Session 3
Chin-Yang Lin ⋅ Chung-Ho Wu ⋅ Changhan Yeh ⋅ Shih Han Yen ⋅ Cheng Sun ⋅ Yu-Lun Liu
ExHall D Poster #55
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons Poster Session 3
Andrew Szot ⋅ Bogdan Mazoure ⋅ Omar Attia ⋅ Aleksei Timofeev ⋅ Harsh Agrawal ⋅ R Devon Hjelm ⋅ Zhe Gan ⋅ Zsolt Kira ⋅ Alexander Toshev
ExHall D Poster #329
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency Poster Session 5
Yikai Wang ⋅ Chenjie Cao ⋅ Junqiu Yu ⋅ Ke Fan ⋅ Xiangyang Xue ⋅ Yanwei Fu
ExHall D Poster #208
Audio-Visual Instance Segmentation Poster Session 3
Ruohao Guo ⋅ Xianghua Ying ⋅ Yaru Chen ⋅ Dantong Niu ⋅ Guangyao Li ⋅ Liao Qu ⋅ Yanyu Qi ⋅ Jinxing Zhou ⋅ Bowei Xing ⋅ Wenzhen Yue ⋅ Ji Shi ⋅ Qixun Wang ⋅ Peiliang Zhang ⋅ Buwen Liang
ExHall D Poster #277
ESCAPE: Equivariant Shape Completion via Anchor Point Encoding Poster Session 2
Burak Bekci ⋅ Nassir Navab ⋅ Federico Tombari ⋅ Mahdi Saleh
ExHall D Poster #105
SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs Poster Session 3
Guibiao Liao ⋅ Qing Li ⋅ Zhenyu Bao ⋅ Guoping Qiu ⋅ KANGLIN LIU
ExHall D Poster #58
IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing Poster Session 3
Chun Gu ⋅ Xiaofei Wei ⋅ Zixuan Zeng ⋅ Yuxuan Yao ⋅ Li Zhang
ExHall D Poster #27
Variance-Based Membership Inference Attacks Against Large-Scale Image Captioning Models Poster Session 2
Daniel Samira ⋅ Edan Habler ⋅ Yuval Elovici ⋅ Asaf Shabtai
ExHall D Poster #366
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation Poster Session 4
Fu Feng ⋅ Yucheng Xie ⋅ Xu Yang ⋅ Jing Wang ⋅ Xin Geng
ExHall D Poster #255
Temporal Alignment-Free Video Matching for Few-shot Action Recognition Poster Session 2
SuBeen Lee ⋅ WonJun Moon ⋅ Hyun Seok Seong ⋅ Jae-Pil Heo
ExHall D Poster #302
FLAVC: Learned Video Compression with Feature Level Attention Poster Session 6
Chun Zhang ⋅ Heming Sun ⋅ Jiro Katto
ExHall D Poster #176
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection Poster Session 1
Marc-Antoine Lavoie ⋅ Anas Mahmoud ⋅ Steven L. Waslander
ExHall D Poster #433
ERUPT: Efficient Rendering with Unposed Patch Transformer Poster Session 2
Maxim Shugaev ⋅ Vincent Chen ⋅ Maxim Karrenbach ⋅ Kyle Ashley ⋅ Bridget Kennedy ⋅ Naresh Cuntoor
ExHall D Poster #59
Empowering Large Language Models with 3D Situation Awareness Poster Session 4
Zhihao Yuan ⋅ Yibo Peng ⋅ Jinke Ren ⋅ Yinghong Liao ⋅ Yatong Han ⋅ Chun-Mei Feng ⋅ Hengshuang Zhao ⋅ Guanbin Li ⋅ Shuguang Cui ⋅ Zhen Li
ExHall D Poster #346
EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights Poster Session 4
Zhenghao Xing ⋅ Hao Chen ⋅ Binzhu Xie ⋅ Jiaqi Xu ⋅ Ziyu Guo ⋅ Xuemiao Xu ⋅ Jianye Hao ⋅ Chi-Wing Fu ⋅ Xiaowei Hu ⋅ Pheng-Ann Heng
ExHall D Poster #315
Improved Monocular Depth Prediction Using Distance Transform Over Pre-semantic Contours with Self-supervised Neural Networks Poster Session 5
Marwane Hariat ⋅ Antoine Manzanera ⋅ David Filliat
ExHall D Poster #78
FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering Poster Session 1
Jingqiu Zhou ⋅ Lue Fan ⋅ Linjiang Huang ⋅ Zhaoxiang Zhang ⋅ Xiaoyu Shi ⋅ Si Liu ⋅ Hongsheng Li
ExHall D Poster #129
Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs Poster Session 2
Yingji Zhong ⋅ Zhihao Li ⋅ Dave Zhenyu Chen ⋅ Lanqing Hong ⋅ Dan Xu
ExHall D Poster #66
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Poster Session 4
Junlong Cheng ⋅ Bin Fu ⋅ Jin Ye ⋅ Guoan Wang ⋅ Tianbin Li ⋅ Haoyu Wang ⋅ Ruoyu Li ⋅ He Yao ⋅ Chen Junren ⋅ Jingwen Li ⋅ Yanzhou Su ⋅ Min Zhu ⋅ Junjun He
ExHall D Poster #479
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error Poster Session 3
Beilin Chu ⋅ Xuan Xu ⋅ Xin Wang ⋅ Yufei Zhang ⋅ Weike You ⋅ Linna Zhou
ExHall D Poster #208
Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration Poster Session 6
Lizheng Zu ⋅ Lin Lin ⋅ Song Fu ⋅ Na Zhao ⋅ Pan Zhou
ExHall D Poster #321
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Poster Session 4
Alejandro Lozano ⋅ Min Woo Sun ⋅ James Burgess ⋅ Liangyu Chen ⋅ Jeffrey J Nirschl ⋅ Jeffrey Gu ⋅ Ivan Lopez ⋅ Josiah Aklilu ⋅ Austin Wolfgang Katzer ⋅ Collin Chiu ⋅ Anita Rau ⋅ Xiaohan Wang ⋅ Yuhui Zhang ⋅ Alfred Seunghoon Song ⋅ Robert Tibshirani ⋅ Serena Yeung
ExHall D Poster #374
Assessing and Learning Alignment of Unimodal Vision and Language Models Poster Session 3
Le Zhang ⋅ Qian Yang ⋅ Aishwarya Agrawal
ExHall D Poster #379
Samba: A Unified Mamba-based Framework for General Salient Object Detection Poster Session 5
Jiahao He ⋅ Keren Fu ⋅ Xiaohong Liu ⋅ Qijun Zhao
ExHall D Poster #408
PAVE: Patching and Adapting Video Large Language Models Poster Session 1
Zhuoming Liu ⋅ Yiquan Li ⋅ Khoi D Nguyen ⋅ Yiwu Zhong ⋅ Yin Li
ExHall D Poster #300
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging Poster Session 6
Maximilian Rokuss ⋅ Yannick Kirchhoff ⋅ Seval Akbal ⋅ Balint Kovacs ⋅ Saikat Roy ⋅ Constantin Ulrich ⋅ Tassilo Wald ⋅ Lukas T. Rotkopf ⋅ Heinz-Peter Schlemmer ⋅ Klaus Maier-Hein
ExHall D Poster #452
Generative Map Priors for Collaborative BEV Semantic Segmentation Poster Session 3
Jiahui Fu ⋅ Yue Gong ⋅ Luting Wang ⋅ Shifeng Zhang ⋅ Xu Zhou ⋅ Si Liu
ExHall D Poster #123
3D-AVS: LiDAR-based 3D Auto-Vocabulary Segmentation Poster Session 2
Weijie Wei ⋅ Osman Ülger ⋅ Fatemeh Karimi Nejadasl ⋅ Theo Gevers ⋅ Martin R. Oswald
ExHall D Poster #339
FedCS: Coreset Selection for Federated Learning Poster Session 3
Chenhe Hao ⋅ Weiying Xie ⋅ Daixun Li ⋅ Haonan Qin ⋅ Hangyu Ye ⋅ Leyuan Fang ⋅ Yunsong Li
ExHall D Poster #458
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models Poster Session 5
Haoyang Li ⋅ Liang Wang ⋅ Chao Wang ⋅ Jing Jiang ⋅ Yan Peng ⋅ Guodong Long
ExHall D Poster #438
Dual-Granularity Semantic Guided Sparse Routing Diffusion Model for General Pansharpening Poster Session 3
Yinghui Xing ⋅ Qu Li Tao ⋅ Shizhou Zhang ⋅ Di Xu ⋅ YingkunYang ⋅ Yanning Zhang
ExHall D Poster #192
SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer Poster Session 3
Chunnan Shang ⋅ Zhizhong Wang ⋅ Hongwei Wang ⋅ Xiangming Meng
ExHall D Poster #229
VIRES: Video Instance Repainting via Sketch and Text Guided Generation Poster Session 6
Shuchen Weng ⋅ Haojie Zheng ⋅ Peixuan Zhang ⋅ Yuchen Hong ⋅ Han Jiang ⋅ Si Li ⋅ Boxin Shi
ExHall D Poster #215
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Poster Session 2
Feng Liu ⋅ Shiwei Zhang ⋅ Xiaofeng Wang ⋅ Yujie Wei ⋅ Haonan Qiu ⋅ Yuzhong Zhao ⋅ Yingya Zhang ⋅ Qixiang Ye ⋅ Fang Wan
ExHall D Poster #190
Towards Practical Real-Time Neural Video Compression Poster Session 3
Zhaoyang Jia ⋅ Bin Li ⋅ Jiahao Li ⋅ Wenxuan Xie ⋅ Linfeng Qi ⋅ Houqiang Li ⋅ Yan Lu
ExHall D Poster #180
Event Fields: Capturing Light Fields at High Speed, Resolution, and Dynamic Range Poster Session 6
Ziyuan Qu ⋅ Zihao Zou ⋅ Vivek Boominathan ⋅ Praneeth Chakravarthula ⋅ Adithya Pediredla
ExHall D Poster #73
Object-Shot Enhanced Grounding Network for Egocentric Video Poster Session 5
Yisen Feng ⋅ Haoyu Zhang ⋅ Meng Liu ⋅ Weili Guan ⋅ Liqiang Nie
ExHall D Poster #303
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation Poster Session 1
Olga Zatsarynna ⋅ Emad Bahrami ⋅ Yazan Abu Farha ⋅ Gianpiero Francesca ⋅ Jürgen Gall
ExHall D Poster #312
METASCENES: Towards Automated Replica Creation for Real-world 3D Scans Poster Session 1
Huangyue Yu ⋅ Baoxiong Jia ⋅ Yixin Chen ⋅ Yandan Yang ⋅ Puhao Li ⋅ Rongpeng Su ⋅ Jiaxin Li ⋅ Qing Li ⋅ Wei Liang ⋅ Song-Chun Zhu ⋅ Tengyu Liu ⋅ Siyuan Huang
ExHall D Poster #140
PerLA: Perceptive 3D Language Assistant Poster Session 3
Guofeng Mei ⋅ Wei Lin ⋅ Luigi Riz ⋅ Yujiao Wu ⋅ Fabio Poiesi ⋅ Yiming Wang
ExHall D Poster #355
beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation Poster Session 6
Ming Hu ⋅ Jianfu Yin ⋅ Zhuangzhuang Ma ⋅ Jianheng Ma ⋅ Feiyu Zhu ⋅ Bingbing Wu ⋅ Ya Wen ⋅ Meng Wu ⋅ C Hu ⋅ Bingliang Hu ⋅ Quan Wang
ExHall D Poster #449
PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models Poster Session 3
Dhouib Mohamed ⋅ Davide Buscaldi ⋅ Vanier Sonia ⋅ Aymen Shabou
ExHall D Poster #377
LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors Poster Session 5
Han Zhou ⋅ Wei Dong ⋅ Jun Chen
ExHall D Poster #50
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation Poster Session 4
Qiyao Xue ⋅ Xiangyu Yin ⋅ Boyuan Yang ⋅ Wei Gao
ExHall D Poster #290
Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Poster Session 2
Hyeonho Jeong ⋅ Chun-Hao P. Huang ⋅ Jong Chul Ye ⋅ Niloy J. Mitra ⋅ Duygu Ceylan
ExHall D Poster #183
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba Poster Session 3
Xiaoyong Lu ⋅ Songlin Du
ExHall D Poster #409
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models Poster Session 4
Keda Tao ⋅ Can Qin ⋅ Haoxuan You ⋅ Yang Sui ⋅ Huan Wang
ExHall D Poster #305
PersonaBooth: Personalized Text-to-Motion Generation Poster Session 5
Boeun Kim ⋅ Hea In Jeong ⋅ JungHoon Sung ⋅ Yihua Cheng ⋅ Jeongmin Lee ⋅ Ju Yong Chang ⋅ Sang-Il Choi ⋅ YOUNGGEUN CHOI ⋅ Saim Shin ⋅ Jungho Kim ⋅ Hyung Jin Chang
ExHall D Poster #161
MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework Poster Session 1
Ping Guo ⋅ Cheng Gong ⋅ Fei Liu ⋅ Xi Lin ⋅ Zhichao Lu ⋅ Qingfu Zhang ⋅ Zhenkun Wang
ExHall D Poster #466
Motion Modes: What Could Happen Next? Poster Session 1
Karran Pandey ⋅ Yannick Hold-Geoffroy ⋅ Matheus Gadelha ⋅ Niloy J. Mitra ⋅ Karan Singh ⋅ Paul Guerrero
ExHall D Poster #175
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation Poster Session 2
Ziheng Zhang ⋅ Jianyang Gu ⋅ Arpita Chowdhury ⋅ Zheda Mai ⋅ David Carlyn ⋅ Tanya Berger-Wolf ⋅ Yu Su ⋅ Wei-Lun Chao
ExHall D Poster #404
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations Poster Session 1
Savya Khosla ⋅ Sethuraman T V ⋅ Alexander G. Schwing ⋅ Derek Hoiem
ExHall D Poster #336
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views Poster Session 2
Ethan Griffiths ⋅ Maryam Haghighat ⋅ Simon Denman ⋅ Clinton Fookes ⋅ Milad Ramezani
ExHall D Poster #122
UniK3D: Universal Camera Monocular 3D Estimation Poster Session 1
Luigi Piccinelli ⋅ Christos Sakaridis ⋅ Mattia Segu ⋅ Yung-Hsu Yang ⋅ Siyuan Li ⋅ Wim Abbeloos ⋅ Luc Van Gool
ExHall D Poster #80
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer Poster Session 2
Jiayi Gao ⋅ Zijin Yin ⋅ Changcheng Hua ⋅ Yuxin Peng ⋅ Kongming Liang ⋅ Zhanyu Ma ⋅ Jun Guo ⋅ Yang Liu
ExHall D Poster #175
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery Poster Session 2
Sara Al-Emadi ⋅ Yin Yang ⋅ Ferda Ofli
ExHall D Poster #280
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining Poster Session 2
Mingjin Zhang ⋅ Xiaolong Li ⋅ Fei Gao ⋅ Jie Guo ⋅ Xinbo Gao ⋅ Jing Zhang
ExHall D Poster #398
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models Poster Session 2
Zhejun Zhang ⋅ Peter Karkus ⋅ Maximilian Igl ⋅ Wenhao Ding ⋅ Yuxiao Chen ⋅ Boris Ivanovic ⋅ Marco Pavone
ExHall D Poster #334
Revisiting Fairness in Multitask Learning: A Performance-Driven Approach for Variance Reduction Poster Session 4
Xiaohan Qin ⋅ Xiaoxing Wang ⋅ Junchi Yan
ExHall D Poster #446
Learning from Synchronization: Self-Supervised Uncalibrated Multi-View Person Association in Challenging Scenes Poster Session 5
Keqi Chen ⋅ vinkle srivastav ⋅ Didier MUTTER ⋅ Nicolas Padoy
ExHall D Poster #324
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network Poster Session 2
Van-Tin Luu ⋅ Yong-Lin Cai ⋅ Vu-Hoang Tran ⋅ Wei-Chen Chiu ⋅ Yi-Ting Chen ⋅ Ching-Chun Huang
ExHall D Poster #127
CLIP-driven Coarse-to-fine Semantic Guidance for Fine-grained Open-set Semi-supervised Learning Poster Session 6
Xiaokun Li ⋅ Yaping Huang ⋅ Qingji Guan
ExHall D Poster #399
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video Poster Session 3
Jiahe Li ⋅ Jiawei Zhang ⋅ Xiao Bai ⋅ Jin Zheng ⋅ Jun Zhou ⋅ Lin Gu
ExHall D Poster #4
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning Poster Session 4
Fan Lu ⋅ Wei Wu ⋅ Kecheng Zheng ⋅ Shuailei Ma ⋅ Biao Gong ⋅ Jiawei Liu ⋅ Wei Zhai ⋅ Yang Cao ⋅ Yujun Shen ⋅ Zheng-Jun Zha
ExHall D Poster #363
FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing Poster Session 2
Hossein Kashiani ⋅ Niloufar Alipour Talemi ⋅ Fatemeh Afghah
ExHall D Poster #325
4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians Poster Session 6
Hidenobu Matsuki ⋅ Gwangbin Bae ⋅ Andrew J. Davison
ExHall D Poster #74
Unseen Visual Anomaly Generation Poster Session 5
HAN SUN ⋅ Yunkang Cao ⋅ Hao Dong ⋅ Olga Fink
ExHall D Poster #427
ReNeg: Learning Negative Embedding with Reward Guidance Poster Session 5
Xiaomin Li ⋅ yixuan liu ⋅ Takashi Isobe ⋅ Xu Jia ⋅ Qinpeng Cui ⋅ Dong Zhou ⋅ Dong Li ⋅ You He ⋅ Huchuan Lu ⋅ Zhongdao Wang ⋅ Emad Barsoum
ExHall D Poster #249
Decoder Gradient Shield: Provable and High-Fidelity Prevention of Gradient-Based Box-Free Watermark Removal Poster Session 3
Haonan An ⋅ Guang Hua ⋅ Zhengru Fang ⋅ Guowen Xu ⋅ Susanto Rahardja ⋅ Yuguang Fang
ExHall D Poster #265
MotionPro: A Precise Motion Controller for Image-to-Video Generation Poster Session 6
Zhongwei Zhang ⋅ Fuchen Long ⋅ Zhaofan Qiu ⋅ Yingwei Pan ⋅ Wu Liu ⋅ Ting Yao ⋅ Tao Mei
ExHall D Poster #170
Goku: Flow Based Video Generative Foundation Models Poster Session 5
Shoufa Chen ⋅ Chongjian GE ⋅ Yuqi Zhang ⋅ Yida Zhang ⋅ Fengda Zhu ⋅ Hao Yang ⋅ Hongxiang Hao ⋅ hui wu ⋅ Zhichao Lai ⋅ Yifei Hu ⋅ Ting-Che Lin ⋅ Shilong Zhang ⋅ Fu Li ⋅ Chuan Li ⋅ Xing Wang ⋅ Yanghua Peng ⋅ Peize Sun ⋅ Ping Luo ⋅ Yi Jiang ⋅ Zehuan Yuan ⋅ BINGYUE PENG ⋅ Xiaobing Liu
ExHall D Poster #235
Learning Conditional Space-Time Prompt Distributions for Video Class-Incremental Learning Poster Session 1
Xiaohan Zou ⋅ Wenchao Ma ⋅ Shu Zhao
ExHall D Poster #449
Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering Poster Session 4
Cheng Sun ⋅ Jaesung Choe ⋅ Charles Loop ⋅ Wei-Chiu Ma ⋅ Yu-Chiang Frank Wang
ExHall D Poster #32
Rethinking Personalized Aesthetics Assessment: Employing Physique Aesthetics Assessment as An Exemplification Poster Session 1
Haobin Zhong ⋅ Shuai He ⋅ Anlong Ming ⋅ Huadong Ma
ExHall D Poster #265
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale Poster Session 1
Baorui Ma ⋅ Huachen Gao ⋅ Haoge Deng ⋅ Zhengxiong Luo ⋅ Tiejun Huang ⋅ Lulu Tang ⋅ Xinlong Wang
ExHall D Poster #172
Locality-Aware Zero-Shot Human-Object Interaction Detection Poster Session 4
Sanghyun Kim ⋅ Deunsol Jung ⋅ Minsu Cho
ExHall D Poster #418
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Poster Session 1
Yangyu Huang ⋅ Tianyi Gao ⋅ Haoran Xu ⋅ Qihao Zhao ⋅ Yang Song ⋅ Zhipeng Gui ⋅ Tengchao Lv ⋅ Hao Chen ⋅ Lei Cui ⋅ Scarlett Li ⋅ Furu Wei
ExHall D Poster #355
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion Poster Session 3
Xiyue Guo ⋅ Jiarui Hu ⋅ Junjie Hu ⋅ Hujun Bao ⋅ Guofeng Zhang
ExHall D Poster #124
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation Poster Session 2
Sankalp Sinha ⋅ Mohammad Sadil Khan ⋅ Muhammad Usama ⋅ Shino Sam ⋅ Didier Stricker ⋅ Sk Aziz Ali ⋅ Muhammad Zeshan Afzal
ExHall D Poster #261
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression Poster Session 4
Dohyun Kim ⋅ Sehwan Park ⋅ GeonHee Han ⋅ Seung Wook Kim ⋅ Paul Hongsuck Seo
ExHall D Poster #270
Interpretable Generative Models through Post-hoc Concept Bottlenecks Poster Session 2
Akshay R. Kulkarni ⋅ Ge Yan ⋅ Chung-En Sun ⋅ Tuomas Oikarinen ⋅ Tsui-Wei Weng
ExHall D Poster #266
SketchAgent: Language-Driven Sequential Sketch Generation Poster Session 5
Yael Vinker ⋅ Tamar Rott Shaham ⋅ Kristine Zheng ⋅ Alex Zhao ⋅ Judith Fan ⋅ Antonio Torralba
ExHall D Poster #220
DRAWER: Digital Reconstruction and Articulation With Environment Realism Poster Session 5
Hongchi Xia ⋅ Entong Su ⋅ Marius Memmel ⋅ Arhan Jain ⋅ Raymond Yu ⋅ Numfor Mbiziwo-Tiapo ⋅ Ali Farhadi ⋅ Abhishek Gupta ⋅ Shenlong Wang ⋅ Wei-Chiu Ma
ExHall D Poster #68
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis Poster Session 5
You Wang ⋅ Li Fang ⋅ Hao Zhu ⋅ Fei Hu ⋅ Long Ye ⋅ Zhan Ma
ExHall D Poster #29
Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection Poster Session 6
Yante Li ⋅ Hanwen Qi ⋅ Haoyu Chen ⋅ Liang Xinlian ⋅ Guoying Zhao
ExHall D Poster #114
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training Poster Session 3
Kai Wang ⋅ Mingjia Shi ⋅ YuKun Zhou ⋅ Zekai Li ⋅ Xiaojiang Peng ⋅ Zhihang Yuan ⋅ Yuzhang Shang ⋅ Hanwang Zhang ⋅ Yang You
ExHall D Poster #218
Empowering LLMs to Understand and Generate Complex Vector Graphics Poster Session 4
XiMing Xing ⋅ Juncheng Hu ⋅ Guotao Liang ⋅ Jing Zhang ⋅ Dong Xu ⋅ Qian Yu
ExHall D Poster #351
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding Poster Session 3
Hongjia Zhai ⋅ Hai Li ⋅ Zhenzhe Li ⋅ Xiaokun Pan ⋅ Yijia He ⋅ Guofeng Zhang
ExHall D Poster #332
ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models Poster Session 3
Heng Yin ⋅ Yuqiang Ren ⋅ Ke Yan ⋅ Shouhong Ding ⋅ Yongtao Hao
ExHall D Poster #354
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors Poster Session 5
Haifeng Huang ⋅ Xinyi Chen ⋅ Yilun Chen ⋅ Hao Li ⋅ Xiaoshen Han ⋅ zehan wang ⋅ Tai Wang ⋅ Jiangmiao Pang ⋅ Zhou Zhao
ExHall D Poster #141
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide Poster Session 1
Dohun Lee ⋅ Bryan Sangwoo Kim ⋅ Geon Yeong Park ⋅ Jong Chul Ye
ExHall D Poster #233
Illumination Spectrum Estimation for Multispectral Images via Surface Reflectance Modeling and Spatial-Spectral Feature Generation Poster Session 1
Hyejin Oh ⋅ Woo-Shik Kim ⋅ Sangyoon Lee ⋅ YungKyung Park ⋅ Jewon Kang
ExHall D Poster #192
UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts Poster Session 5
Yidi Liu ⋅ Dong Li ⋅ Xueyang Fu ⋅ Xin Lu ⋅ Jie Huang ⋅ Zheng-Jun Zha
ExHall D Poster #196
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models Poster Session 4
Jiacong Xu ⋅ Shao-Yuan Lo ⋅ Bardia Safaei ⋅ Vishal M. Patel ⋅ Isht Dwivedi
ExHall D Poster #435
AIpparel: A Multimodal Foundation Model for Digital Garments Poster Session 2
Kiyohiro Nakayama ⋅ Jan Ackermann ⋅ Timur Levent Kesdogan ⋅ Yang Zheng ⋅ Maria Korosteleva ⋅ Olga Sorkine-Hornung ⋅ Leonidas Guibas ⋅ Guandao Yang ⋅ Gordon Wetzstein
ExHall D Poster #264
Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction Poster Session 5
Dong Li ⋅ Wenqi Zhong ⋅ Wei Yu ⋅ Yingwei Pan ⋅ Dingwen Zhang ⋅ Ting Yao ⋅ Junwei Han ⋅ Tao Mei
ExHall D Poster #151
Exploring Contextual Attribute Density in Referring Expression Counting Poster Session 4
Zhicheng Wang ⋅ Zhiyu Pan ⋅ Zhan Peng ⋅ Jian Cheng ⋅ Liwen Xiao ⋅ Wei Jiang ⋅ Zhiguo Cao
ExHall D Poster #360
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment Poster Session 5
Dahyun Kang ⋅ Piotr Bojanowski ⋅ Huy V. Vo ⋅ Théo Moutakanni ⋅ Cijo Jose ⋅ Federico Baldassarre ⋅ Patrick Labatut ⋅ Michael Ramamonjisoa ⋅ Maxime Oquab ⋅ Timothée Darcet ⋅ Hu Xu ⋅ Shang-Wen Li ⋅ Oriane Simeoni ⋅ Marc Szafraniec
ExHall D Poster #370
Learning Affine Correspondences by Integrating Geometric Constraints Poster Session 6
Pengju Sun ⋅ Banglei Guan ⋅ Zhenbao Yu ⋅ Yang Shang ⋅ Qifeng Yu ⋅ Daniel Barath
ExHall D Poster #85
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Poster Session 3
Shufan Li ⋅ Konstantinos Kallidromitis ⋅ Akash Gokul ⋅ Zichun Liao ⋅ Yusuke Kato ⋅ Kazuki Kozuka ⋅ Aditya Grover
ExHall D Poster #241
Multiple Object Tracking as ID Prediction Poster Session 6
Ruopeng Gao ⋅ Ji Qi ⋅ Limin Wang
ExHall D Poster #163
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos Poster Session 4
Yuzheng Liu ⋅ Siyan Dong ⋅ Shuzhe Wang ⋅ Yingda Yin ⋅ Yanchao Yang ⋅ Qingnan Fan ⋅ Baoquan Chen
ExHall D Poster #78
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion Poster Session 1
Haosen Yang ⋅ Adrian Bulat ⋅ Isma Hadji ⋅ Hai X. Pham ⋅ Xiatian Zhu ⋅ Georgios Tzimiropoulos ⋅ Brais Martinez
ExHall D Poster #219
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing Poster Session 6
Yiheng Li ⋅ RuiBing Hou ⋅ Hong Chang ⋅ Shiguang Shan ⋅ Xilin Chen
ExHall D Poster #156
POMP: Physics-constrainable Motion Generative Model through Phase Manifolds Poster Session 5
Bin Ji ⋅ Ye Pan ⋅ zhimeng Liu ⋅ Shuai Tan ⋅ Xiaogang Jin ⋅ Xiaokang Yang
ExHall D Poster #155
NN-Former: Rethinking Graph Structure in Neural Architecture Representation Poster Session 2
Ruihan Xu ⋅ Haokui Zhang ⋅ Yaowei Wang ⋅ Wei Zeng ⋅ Shiliang Zhang
ExHall D Poster #441
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams Poster Session 6
Chris Dongjoo Kim ⋅ Jihwan Moon ⋅ Sangwoo Moon ⋅ Heeseung Yun ⋅ Sihaeng Lee ⋅ Aniruddha Kembhavi ⋅ Soonyoung Lee ⋅ Gunhee Kim ⋅ Sangho Lee ⋅ Christopher Clark
ExHall D Poster #277
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Poster Session 4
Siyan Dong ⋅ Shuzhe Wang ⋅ Shaohui Liu ⋅ Lulu Cai ⋅ Qingnan Fan ⋅ Juho Kannala ⋅ Yanchao Yang
ExHall D Poster #87
Multi-Modal Contrastive Masked Autoencoders: A Two-Stage Progressive Pre-training Approach for RGBD Datasets Poster Session 4
Muhammad Abdullah Jamal ⋅ Omid Mohareri
ExHall D Poster #204
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos Poster Session 1
Jinglei Zhang ⋅ Jiankang Deng ⋅ Chao Ma ⋅ Rolandos Alexandros Potamias
ExHall D Poster #152
Font-Agent: Enhancing Font Understanding with Large Language Models Poster Session 4
Yingxin Lai ⋅ Cuijie Xu ⋅ Haitian Shi ⋅ Guoqing Yang ⋅ Xiaoning Li ⋅ Zhiming Luo ⋅ Shaozi Li
ExHall D Poster #368
RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models Poster Session 5
Greg Heinrich ⋅ Mike Ranzinger ⋅ Danny Yin ⋅ Yao Lu ⋅ Jan Kautz ⋅ Bryan Catanzaro ⋅ Andrew Tao ⋅ Pavlo Molchanov
ExHall D Poster #136
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Poster Session 1
Cédric Vincent ⋅ Taehyoung Kim ⋅ Henri Meeß
ExHall D Poster #121
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding Poster Session 4
Jinlong Li ⋅ Cristiano Saltori ⋅ Fabio Poiesi ⋅ Nicu Sebe
ExHall D Poster #342
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation Poster Session 3
Hao Du ⋅ Bo Wu ⋅ Yan Lu ⋅ Zhendong Mao
ExHall D Poster #301
Mixture of Submodules for Domain Adaptive Person Search Poster Session 3
Minsu Kim ⋅ Seungryong Kim ⋅ Kwanghoon Sohn
ExHall D Poster #320
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation Poster Session 4
Duc-Hai Pham ⋅ Tung Do ⋅ Phong Nguyen ⋅ Binh-Son Hua ⋅ Khoi Nguyen ⋅ Rang Nguyen
ExHall D Poster #119
MambaVision: A Hybrid Mamba-Transformer Vision Backbone Poster Session 5
Ali Hatamizadeh ⋅ Jan Kautz
ExHall D Poster #403
NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks Poster Session 2
Chenyi Zhang ⋅ Ting Liu ⋅ Xiaochao Qu ⋅ Luoqi Liu ⋅ Yao Zhao ⋅ Yunchao Wei
ExHall D Poster #340
Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation Poster Session 5
Markus Karmann ⋅ Onay Urfalioglu
ExHall D Poster #333
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval Poster Session 3
Yuanmin Tang ⋅ Jue Zhang ⋅ Xiaoting Qin ⋅ Jing Yu ⋅ Gaopeng Gou ⋅ Gang Xiong ⋅ Qingwei Lin ⋅ Saravan Rajmohan ⋅ Dongmei Zhang ⋅ Qi Wu
ExHall D Poster #359
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector Poster Session 6
Zechuan Li ⋅ Hongshan Yu ⋅ Yihao Ding ⋅ Jinhao Qiao ⋅ Basim Azam ⋅ Naveed Akhtar
ExHall D Poster #101
EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events Poster Session 4
Shuoyan Wei ⋅ Feng Li ⋅ Shengeng Tang ⋅ Yao Zhao ⋅ Huihui Bai
ExHall D Poster #186
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning Poster Session 5
Jinpeng Wang ⋅ Tianci Luo ⋅ Yaohua Zha ⋅ Yan Feng ⋅ Ruisheng Luo ⋅ Bin Chen ⋅ Tao Dai ⋅ Long Chen ⋅ Yaowei Wang ⋅ Shu-Tao Xia
ExHall D Poster #393
Rethinking Reconstruction and Denoising in the Dark: New Perspective, General Architecture and Beyond Poster Session 1
Long Ma ⋅ Tengyu Ma ⋅ Ziye Li ⋅ Yuetong Wang ⋅ Jinyuan Liu ⋅ Chengpei Xu ⋅ Risheng Liu
ExHall D Poster #203
Unity in Diversity: Video Editing via Gradient-Latent Purification Poster Session 5
Junyu Gao ⋅ Kunlin Yang ⋅ Xuan Yao ⋅ Yufan Hu
ExHall D Poster #224
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection Poster Session 6
Songhao Han ⋅ Wei Huang ⋅ Hairong Shi ⋅ Le Zhuo ⋅ Xiu Su ⋅ Shifeng Zhang ⋅ Xu Zhou ⋅ Xiaojuan Qi ⋅ Yue Liao ⋅ Si Liu
ExHall D Poster #266
INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations Poster Session 3
Yongming Zhu ⋅ Longhao Zhang ⋅ Zhengkun Rong ⋅ Tianshu Hu ⋅ Shuang Liang ⋅ Zhipengge
ExHall D Poster #2
Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D Motion Poster Session 5
Saad Lahlali ⋅ Sandra Kara ⋅ Hejer AMMAR ⋅ Florian Chabot ⋅ Nicolas Granger ⋅ Hervé Le Borgne ⋅ Quoc Cuong PHAM
ExHall D Poster #334
Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations Poster Session 1
Jiate Li ⋅ Meng Pang ⋅ Yun Dong ⋅ Binghui Wang
ExHall D Poster #464
Scene-Centric Unsupervised Panoptic Segmentation Poster Session 5
Oliver Hahn ⋅ Christoph Reich ⋅ Nikita Araslanov ⋅ Daniel Cremers ⋅ Christian Rupprecht ⋅ Stefan Roth
ExHall D Poster #330
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing Poster Session 3
Yung-Hsuan Lai ⋅ Janek Ebbers ⋅ Yu-Chiang Frank Wang ⋅ François Germain ⋅ Michael J. Jones ⋅ Moitreya Chatterjee
ExHall D Poster #278
Mosaic of Modalities: A Comprehensive Benchmark for Multimodal Graph Learning Poster Session 3
Jing Zhu ⋅ Yuhang Zhou ⋅ Shengyi Qian ⋅ Zhongmou He ⋅ Tong Zhao ⋅ Neil Shah ⋅ Danai Koutra
ExHall D Poster #341
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement Poster Session 3
Ian Huang ⋅ Yanan Bao ⋅ Karen Truong ⋅ Howard Zhou ⋅ Cordelia Schmid ⋅ Leonidas Guibas ⋅ Alireza Fathi
ExHall D Poster #269
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding Poster Session 1
Zhenxing Zhang ⋅ Yaxiong Wang ⋅ Lechao Cheng ⋅ Zhun Zhong ⋅ Dan Guo ⋅ Meng Wang
ExHall D Poster #365
MITracker: Multi-View Integration for Visual Object Tracking Poster Session 6
Mengjie Xu ⋅ Yitao Zhu ⋅ Haotian Jiang ⋅ Jiaming Li ⋅ Zhenrong Shen ⋅ Sheng Wang ⋅ Haolin Huang ⋅ Xinyu Wang ⋅ Han Zhang ⋅ Qing Yang ⋅ Qian Wang
ExHall D Poster #98
Dual Diffusion for Unified Image Generation and Understanding Poster Session 1
Zijie Li ⋅ Henry Li ⋅ Yichun Shi ⋅ Amir Barati Farimani ⋅ Yuval Kluger ⋅ Linjie Yang ⋅ Peng Wang
ExHall D Poster #251
StoryGPT-V: Large Language Models as Consistent Story Visualizers Poster Session 3
Xiaoqian Shen ⋅ Mohamed Elhoseiny
ExHall D Poster #250
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing Poster Session 2
Jinlu Zhang ⋅ Yixin Chen ⋅ Zan Wang ⋅ Jie Yang ⋅ Yizhou Wang ⋅ Siyuan Huang
ExHall D Poster #157
Towards Human-Understandable Multi-Dimensional Concept Discovery Poster Session 4
Arne Grobrügge ⋅ Niklas Kühl ⋅ Gerhard Satzger ⋅ Philipp Spitzer
ExHall D Poster #402
GPVK-VL: Geometry-Preserving Virtual Keyframes for Visual Localization under Large Viewpoint Changes Poster Session 4
Yunxuan Li ⋅ Lei Fan ⋅ Xiaoying Xing ⋅ Jianxiong Zhou ⋅ Ying Wu
ExHall D Poster #86
Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision Poster Session 2
Manon Dampfhoffer ⋅ Thomas Mesquida ⋅ Damien Joubert ⋅ Thomas Dalgaty ⋅ Pascal Vivet ⋅ Christoph Posch
ExHall D Poster #147
Be More Specific: Evaluating Object-centric Realism in Synthetic Images Poster Session 6
Anqi Liang ⋅ Ciprian Adrian Corneanu ⋅ Qianli Feng ⋅ Giorgio Giannone ⋅ Aleix Martinez
ExHall D Poster #255
Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting Poster Session 6
Kaouther Messaoud ⋅ Matthieu Cord ⋅ Alex Alahi
ExHall D Poster #134
Seeing A 3D World in A Grain of Sand Poster Session 3
Yufan Zhang ⋅ Yu Ji ⋅ Yu Guo ⋅ Jinwei Ye
ExHall D Poster #51
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device Poster Session 1
Yushu Wu ⋅ Zhixing Zhang ⋅ Yanyu Li ⋅ Yanwu Xu ⋅ Anil Kag ⋅ Yang Sui ⋅ Huseyin Coskun ⋅ Ke Ma ⋅ Aleksei Lebedev ⋅ Ju Hu ⋅ Dimitris N. Metaxas ⋅ Yanzhi Wang ⋅ Sergey Tulyakov ⋅ Jian Ren
ExHall D Poster #221
Adapting Dense Matching for Homography Estimation with Grid-based Acceleration Poster Session 2
Kaining Zhang ⋅ Yuxin Deng ⋅ Jiayi Ma ⋅ Paolo Favaro
ExHall D Poster #84
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis Poster Session 3
Mengtian Li ⋅ Jinshu Chen ⋅ Wanquan Feng ⋅ Bingchuan Li ⋅ Fei Dai ⋅ Songtao Zhao ⋅ Qian HE
ExHall D Poster #235
DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting Poster Session 5
Hyunwoo Park ⋅ Gun Ryu ⋅ Wonjun Kim
ExHall D Poster #52
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation Poster Session 5
Ziyu Zhao ⋅ Xiaoguang Li ⋅ Lingjia Shi ⋅ Nasrin Imanpour ⋅ Song Wang
ExHall D Poster #411
One Diffusion to Generate Them All Poster Session 1
Duong H. Le ⋅ Tuan Pham ⋅ Sangho Lee ⋅ Christopher Clark ⋅ Aniruddha Kembhavi ⋅ Stephan Mandt ⋅ Ranjay Krishna ⋅ Jiasen Lu
ExHall D Poster #240
Bias for Action: Video Implicit Neural Representations with Bias Modulation Poster Session 6
Alper Kayabasi ⋅ Anil Kumar Vadathya ⋅ Guha Balakrishnan ⋅ Vishwanath Saragadam
ExHall D Poster #174
Let's Verify and Reinforce Image Generation Step by Step Poster Session 6
Renrui Zhang ⋅ Chengzhuo Tong ⋅ Zhizheng Zhao ⋅ Ziyu Guo ⋅ Haoquan Zhang ⋅ Manyuan Zhang ⋅ Jiaming Liu ⋅ Peng Gao ⋅ Hongsheng Li
ExHall D Poster #238
GLASS: Guided Latent Slot Diffusion for Object-Centric Learning Poster Session 6
Krishnakant Singh ⋅ Simone Schaub-Meyer ⋅ Stefan Roth
ExHall D Poster #239
Multimodal Autoregressive Pre-training of Large Vision Encoders Poster Session 2
Enrico Fini ⋅ Mustafa Shukor ⋅ Xiujun Li ⋅ Philipp Dufter ⋅ Michal Klein ⋅ David Haldimann ⋅ Sai Aitharaju ⋅ Victor Guilherme Turrisi da Costa ⋅ Louis Béthune ⋅ Zhe Gan ⋅ Alexander Toshev ⋅ Marcin Eichner ⋅ Moin Nabi ⋅ Yinfei Yang ⋅ Joshua Susskind ⋅ Alaaeldin El-Nouby
ExHall D Poster #407
UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning Poster Session 2
Long Zhou ⋅ Fereshteh Shakeri ⋅ Aymen Sadraoui ⋅ Mounir Kaaniche ⋅ Jean-Christophe Pesquet ⋅ Ismail Ben Ayed
ExHall D Poster #409
SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds Poster Session 6
Jinfeng Xu ⋅ Xianzhi Li ⋅ Yuan Tang ⋅ Xu Han ⋅ Qiao Yu ⋅ yixue Hao ⋅ Long Hu ⋅ Min Chen
ExHall D Poster #109
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption Poster Session 3
Du CHEN ⋅ Tianhe Wu ⋅ Kede Ma ⋅ Lei Zhang
ExHall D Poster #200
Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification Poster Session 2
Zequn Zeng ⋅ Yudi Su ⋅ Jianqiao Sun ⋅ Tiansheng Wen ⋅ Hao Zhang ⋅ Zhengjue Wang ⋅ Bo Chen ⋅ Hongwei Liu ⋅ Jiawei Ma
ExHall D Poster #395
Hazy Low-Quality Satellite Video Restoration Via Learning Optimal Joint Degradation Patterns and Continuous-Scale Super-Resolution Reconstruction Poster Session 3
Ning Ni ⋅ Libao Zhang
ExHall D Poster #195
Textured Gaussians for Enhanced 3D Scene Appearance Modeling Poster Session 2
Brian Chao ⋅ Hung-Yu Tseng ⋅ Lorenzo Porzi ⋅ Chen Gao ⋅ Tuotuo Li ⋅ Qinbo Li ⋅ Ayush Saraf ⋅ Jia-Bin Huang ⋅ Johannes Kopf ⋅ Gordon Wetzstein ⋅ Changil Kim
ExHall D Poster #344
OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary Poster Session 6
Yifeng Yang ⋅ Lin Zhu ⋅ Zewen Sun ⋅ Hengyu Liu ⋅ Qinying Gu ⋅ Nanyang Ye
ExHall D Poster #429
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering Poster Session 6
Md Mohaiminul Islam ⋅ Tushar Nagarajan ⋅ Huiyu Wang ⋅ Gedas Bertasius ⋅ Lorenzo Torresani
ExHall D Poster #282
Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment Poster Session 6
Johannes Schusterbauer ⋅ Ming Gui ⋅ Frank Fundel ⋅ Björn Ommer
ExHall D Poster #208
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Poster Session 2
Yiyang Ma ⋅ Xingchao Liu ⋅ Xiaokang Chen ⋅ Wen Liu ⋅ Chengyue Wu ⋅ Zhiyu Wu ⋅ Zizheng Pan ⋅ Zhenda Xie ⋅ Haowei Zhang ⋅ Xingkai Yu ⋅ Liang Zhao ⋅ Yisong Wang ⋅ Jiaying Liu ⋅ Chong Ruan
ExHall D Poster #227
Visual Prompting for One-shot Controllable Video Editing without Inversion Poster Session 2
Zhengbo Zhang ⋅ Yuxi Zhou ⋅ DUO PENG ⋅ Joo Lim ⋅ Zhigang Tu ⋅ De Soh Soh ⋅ Lin Geng Foo
ExHall D Poster #231
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data Poster Session 4
Hanwen Jiang ⋅ Zexiang Xu ⋅ Desai Xie ⋅ Chen Ziwen ⋅ Haian Jin ⋅ Fujun Luan ⋅ ZHIXIN SHU ⋅ Kai Zhang ⋅ Sai Bi ⋅ Xin Sun ⋅ Jiuxiang Gu ⋅ Qixing Huang ⋅ Georgios Pavlakos ⋅ Hao Tan
ExHall D Poster #57
Prof. Robot: Differentiable Robot Rendering Without Static and Self-Collisions Poster Session 5
Quanyuan Ruan ⋅ Jiabao Lei ⋅ Wenhao Yuan ⋅ Yanglin Zhang ⋅ Dekun Lu ⋅ Guiliang Liu ⋅ Kui Jia
ExHall D Poster #143
Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation Poster Session 2
Tianfu Wang ⋅ Mingyang Xie ⋅ Haoming Cai ⋅ Sachin Shah ⋅ Christopher Metzler
ExHall D Poster #23
PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches Poster Session 2
Dennis Jacob ⋅ Chong Xiang ⋅ Prateek Mittal
ExHall D Poster #435
StarVector: Generating Scalable Vector Graphics Code from Images and Text Poster Session 4
Juan Rodriguez ⋅ Abhay Puri ⋅ Shubham Agarwal ⋅ Issam Laradji ⋅ Pau Rodriguez ⋅ Sai Rajeswar ⋅ David Vazquez ⋅ Christopher Pal ⋅ Marco Pedersoli
ExHall D Poster #31
Novel View Synthesis with Pixel-Space Diffusion Models Poster Session 6
Noam Elata ⋅ Bahjat Kawar ⋅ Yaron Ostrovsky-Berman ⋅ Miriam Farber ⋅ Ron Sokolovsky
ExHall D Poster #59
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction Poster Session 1
Zhimin Liao ⋅ Ping Wei ⋅ Shuaijia Chen ⋅ Haoxuan Wang ⋅ Ziyang Ren
ExHall D Poster #126
Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video Poster Session 2
Marchellus Matthew ⋅ Nadhira Noor ⋅ In Kyu Park
ExHall D Poster #72
SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering Poster Session 4
Hanxiao Sun ⋅ Yupeng Gao ⋅ Jin Xie ⋅ Jian Yang ⋅ Beibei Wang
ExHall D Poster #28
Beyond Single-Modal Boundary: Cross-Modal Anomaly Detection through Visual Prototype and Harmonization Poster Session 2
Kai Mao ⋅ Ping Wei ⋅ Yiyang Lian ⋅ Yangyang Wang ⋅ Nanning Zheng
ExHall D Poster #437
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents Poster Session 5
Ryota Tanaka ⋅ Taichi Iki ⋅ Taku Hasegawa ⋅ Kyosuke Nishida ⋅ Kuniko Saito ⋅ Jun Suzuki
ExHall D Poster #363
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Poster Session 1
Mahtab Bigverdi ⋅ Zelun Luo ⋅ Cheng-Yu Hsieh ⋅ Ethan Shen ⋅ Dongping Chen ⋅ Linda Shapiro ⋅ Ranjay Krishna
ExHall D Poster #349
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Poster Session 1
David Junhao Zhang ⋅ Roni Paiss ⋅ Shiran Zada ⋅ Nikhil Karnad ⋅ David E. Jacobs ⋅ Yael Pritch ⋅ Inbar Mosseri ⋅ Mike Zheng Shou ⋅ Neal Wadhwa ⋅ Nataniel Ruiz
ExHall D Poster #177
Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers? Poster Session 6
Zebin You ⋅ Xinyu Zhang ⋅ Hanzhong Guo ⋅ Jingdong Wang ⋅ Chongxuan Li
ExHall D Poster #250
X-Dyna: Expressive Dynamic Human Image Animation Poster Session 2
Di Chang ⋅ Hongyi Xu ⋅ You Xie ⋅ Yipeng Gao ⋅ Zhengfei Kuang ⋅ Shengqu Cai ⋅ Chenxu Zhang ⋅ Guoxian Song ⋅ Chao Wang ⋅ Yichun Shi ⋅ Zeyuan Chen ⋅ Shijie Zhou ⋅ Linjie Luo ⋅ Gordon Wetzstein ⋅ Mohammad Soleymani
ExHall D Poster #5
Understanding Multi-layered Transmission Matrices Poster Session 5
Marina Alterman ⋅ Anat Levin
ExHall D Poster #201
GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking Poster Session 5
Weikang Bian ⋅ Zhaoyang Huang ⋅ Xiaoyu Shi ⋅ Yijin Li ⋅ Fu-Yun Wang ⋅ Hongsheng Li
ExHall D Poster #63
Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation Poster Session 6
Xinhao Zhong ⋅ Hao Fang ⋅ Bin Chen ⋅ Xulin Gu ⋅ Meikang Qiu ⋅ Shuhan Qi ⋅ Shu-Tao Xia
ExHall D Poster #413
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models Poster Session 6
Kwan Yun ⋅ Seokhyeon Hong ⋅ Chaelin Kim ⋅ Junyong Noh
ExHall D Poster #159
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models Poster Session 6
Ozgur Kara ⋅ Krishna Kumar Singh ⋅ Feng Liu ⋅ Duygu Ceylan ⋅ James Rehg ⋅ Tobias Hinz
ExHall D Poster #213
Context-Aware Multimodal Pretraining Poster Session 1
Karsten Roth ⋅ Zeynep Akata ⋅ Dima Damen ⋅ Ivana Balazevic ⋅ Olivier J Henaff
ExHall D Poster #391
Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues Poster Session 6
Sihong Huang ⋅ Jiaxin Wu ⋅ Xiaoyong Wei ⋅ Yi Cai ⋅ Dongmei Jiang ⋅ Yaowei Wang
ExHall D Poster #265
LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models Poster Session 6
Fan-Yun Sun ⋅ Weiyu Liu ⋅ Siyi Gu ⋅ Dylan Lim ⋅ Goutam Bhat ⋅ Federico Tombari ⋅ Manling Li ⋅ Nick Haber ⋅ Jiajun Wu
ExHall D Poster #317
ODA-GAN: Orthogonal Decoupling Alignment GAN Assisted by Weakly-supervised Learning for Virtual Immunohistochemistry Staining Poster Session 5
Tong Wang ⋅ Mingkang Wang ⋅ Zhongze Wang ⋅ Hongkai Wang ⋅ Qi Xu ⋅ Fengyu Cong ⋅ Hongming Xu
ExHall D Poster #469
Faster Parameter-Efficient Tuning with Token Redundancy Reduction Poster Session 6
Kwonyoung Kim ⋅ Jungin Park ⋅ Jin Kim ⋅ Hyeongjun Kwon ⋅ Kwanghoon Sohn
ExHall D Poster #387
HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation Poster Session 6
Trong-Thuan Nguyen ⋅ Pha Nguyen ⋅ Jackson Cothren ⋅ Alper Yilmaz ⋅ Khoa Luu
ExHall D Poster #287
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers Poster Session 3
Hui Zhang ⋅ Tingwei Gao ⋅ Jie Shao ⋅ Zuxuan Wu
ExHall D Poster #214
DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework Poster Session 1
Yalong Xu ⋅ Lin Zhao ⋅ Chen Gong ⋅ Guangyu Li ⋅ Di Wang ⋅ Nannan Wang
ExHall D Poster #92
Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis Poster Session 5
Awais Nizamani ⋅ Hamid Laga ⋅ Guanjin Wang ⋅ Farid Boussaid ⋅ Mohammed Bennamoun ⋅ Anuj Srivastava
ExHall D Poster #70
Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering Poster Session 3
Yuanlin Wang ⋅ Yiyang Zhang ⋅ Ruiqin Xiong ⋅ Jing Zhao ⋅ Jian Zhang ⋅ Xiaopeng Fan ⋅ Tiejun Huang
ExHall D Poster #72
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems Poster Session 5
Xiangyuan Xue ⋅ Zeyu Lu ⋅ Di Huang ⋅ ZiDong Wang ⋅ Wanli Ouyang ⋅ Lei Bai
ExHall D Poster #343
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos Poster Session 4
Shehan Munasinghe ⋅ Hanan Gani ⋅ Wenqi Zhu ⋅ Jiale Cao ⋅ Eric P. Xing ⋅ Fahad Shahbaz Khan ⋅ Salman Khan
ExHall D Poster #309
DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models Poster Session 3
Radu Alexandru Rosu ⋅ Keyu Wu ⋅ Yao Feng ⋅ Youyi Zheng ⋅ Michael J. Black
ExHall D Poster #18
Hyperbolic Category Discovery Poster Session 2
Yuanpei Liu ⋅ Zhenqi He ⋅ Kai Han
ExHall D Poster #430
CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models Poster Session 1
Kiet A. Nguyen ⋅ Adheesh Juvekar ⋅ Tianjiao Yu ⋅ Muntasir Wahed ⋅ Ismini Lourentzou
ExHall D Poster #420
Cross-modal Causal Relation Alignment for Video Question Grounding Poster Session 5
weixing chen ⋅ Yang Liu ⋅ Binglin Chen ⋅ Jiandong Su ⋅ Yongsen Zheng ⋅ Liang Lin
ExHall D Poster #293
Self-Expansion of Pre-trained Models with Mixture of Adapters for Continual Learning Poster Session 2
Huiyi Wang ⋅ Haodong Lu ⋅ Lina Yao ⋅ Dong Gong
ExHall D Poster #449
Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection Poster Session 1
Ziqi Li ⋅ Tao Gao ⋅ Yisheng An ⋅ Ting Chen ⋅ Jing Zhang ⋅ Yuanbo Wen ⋅ Mengkun Liu ⋅ Qianxi Zhang
ExHall D Poster #322
Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free Unsupervised Domain Adaptation Poster Session 6
Peihua Deng ⋅ Jiehua Zhang ⋅ Xichun Sheng ⋅ Chenggang Yan ⋅ Yaoqi Sun ⋅ Ying Fu ⋅ Liang Li
ExHall D Poster #423
A Polarization-Aided Transformer for Image Deblurring via Motion Vector Decomposition Poster Session 6
Duosheng Chen ⋅ Shihao Zhou ⋅ Jinshan Pan ⋅ Jinglei Shi ⋅ lishen qu ⋅ Jufeng Yang
ExHall D Poster #180
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering Poster Session 2
Liang Chen ⋅ Zhe Xue ⋅ Yawen Li ⋅ Meiyu Liang ⋅ Yan Wang ⋅ Anton van den Hengel ⋅ Yuankai Qi
ExHall D Poster #469
Enhancing Creative Generation on Stable Diffusion-based Models Poster Session 6
Jiyeon Han ⋅ Dahee Kwon ⋅ Gayoung Lee ⋅ Junho Kim ⋅ Jaesik Choi
ExHall D Poster #233
Denoising Functional Maps: Diffusion Models for Shape Correspondence Poster Session 6
Aleksei Zhuravlev ⋅ Zorah Lähner ⋅ Vladislav Golyanik
ExHall D Poster #72
GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion Poster Session 2
Jiapeng Tang ⋅ Davide Davoli ⋅ Tobias Kirschstein ⋅ Liam Schoneveld ⋅ Matthias Nießner
ExHall D Poster #10
AutoURDF: Unsupervised Robot Modeling from Point Cloud Frames Using Cluster Registration Poster Session 6
Jiong Lin ⋅ Lechen Zhang ⋅ Kwansoo Lee ⋅ Jialong Ning ⋅ Judah A Goldfeder ⋅ Hod Lipson
ExHall D Poster #140
Building Vision Models upon Heat Conduction Poster Session 2
Zhaozhi Wang ⋅ Yue Liu ⋅ Yunjie Tian ⋅ Yunfan Liu ⋅ Yaowei Wang ⋅ Qixiang Ye
ExHall D Poster #413
ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping Poster Session 4
Shun Iwase ⋅ Muhammad Zubair Irshad ⋅ Katherine Liu ⋅ Vitor Guizilini ⋅ Robert Lee ⋅ Takuya Ikeda ⋅ Ayako Amma ⋅ Koichi Nishiwaki ⋅ Kris Kitani ⋅ Rares Andrei Ambrus ⋅ Sergey Zakharov
ExHall D Poster #154
Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation Poster Session 3
Jingxi Chen ⋅ Brandon Y. Feng ⋅ Haoming Cai ⋅ Tianfu Wang ⋅ Levi Burner ⋅ Dehao Yuan ⋅ Cornelia Fermuller ⋅ Christopher Metzler ⋅ Yiannis Aloimonos
ExHall D Poster #172
Progress-Aware Video Frame Captioning Poster Session 3
Zihui Xue ⋅ Joungbin An ⋅ Xitong Yang ⋅ Kristen Grauman
ExHall D Poster #285
Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery Poster Session 6
Jiahua Rao ⋅ Hanjing Lin ⋅ Leyu Chen ⋅ Jiancong Xie ⋅ Shuangjia Zheng ⋅ Yuedong Yang
ExHall D Poster #441
R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning Poster Session 6
Lijun Sheng ⋅ Jian Liang ⋅ Zilei Wang ⋅ Ran He
ExHall D Poster #364
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis Poster Session 5
Hyojun Go ⋅ byeongjun park ⋅ Jiho Jang ⋅ Jin-Young Kim ⋅ Soonwoo Kwon ⋅ Changick Kim
ExHall D Poster #45
GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras Poster Session 6
Hanzhang Tu ⋅ Zhanfeng Liao ⋅ Boyao Zhou ⋅ Shunyuan Zheng ⋅ Xilong Zhou ⋅ Liuxin ZHANG ⋅ QianYing Wang ⋅ Yebin Liu
ExHall D Poster #18
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding Poster Session 2
Mingfei Chen ⋅ Israel D. Gebru ⋅ Ishwarya Ananthabhotla ⋅ Christian Richardt ⋅ Dejan Markovic ⋅ Steven Krenn ⋅ Todd Keebler ⋅ Jacob Sandakly ⋅ Alexander Richard ⋅ Eli Shlizerman
ExHall D Poster #283
Omni-ID: Holistic Identity Representation Designed for Generative Tasks Poster Session 2
Guocheng Qian ⋅ Kuan-Chieh Wang ⋅ Or Patashnik ⋅ Negin Heravi ⋅ Daniil Ostashev ⋅ Sergey Tulyakov ⋅ Daniel Cohen-Or ⋅ Kfir Aberman
ExHall D Poster #326
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosC Poster Session 5
Yuan Li ⋅ Ziqian Bai ⋅ Feitong Tan ⋅ Zhaopeng Cui ⋅ Sean Fanello ⋅ Yinda Zhang
ExHall D Poster #6
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments Poster Session 4
Ege Özsoy ⋅ Chantal Pellegrini ⋅ Tobias Czempiel ⋅ Felix Tristram ⋅ Kun yuan ⋅ David Bani-Harouni ⋅ Ulrich Eck ⋅ Benjamin Busam ⋅ Matthias Keicher ⋅ Nassir Navab
ExHall D Poster #341
MIRE: Matched Implicit Neural Representations Poster Session 2
Dhananjaya Jayasundara ⋅ Heng Zhao ⋅ Demetrio Labate ⋅ Vishal M. Patel
ExHall D Poster #278
AeSPa : Attention-guided Self-supervised Parallel Imaging for MRI Reconstruction Poster Session 1
Jinho Joo ⋅ Hyeseong Kim ⋅ Hyeyeon Won ⋅ Deukhee Lee ⋅ Taejoon Eo ⋅ Dosik Hwang
ExHall D Poster #483
Visual Consensus Prompting for Co-Salient Object Detection Poster Session 2
Jie Wang ⋅ Nana Yu ⋅ Zihao Zhang ⋅ Yahong Han
ExHall D Poster #402
MagicArticulate: Make Your 3D Models Articulation-Ready Poster Session 4
Chaoyue Song ⋅ Jianfeng Zhang ⋅ Xiu Li ⋅ Fan Yang ⋅ Yiwen Chen ⋅ Zhongcong Xu ⋅ Jun Hao Liew ⋅ Xiaoyang Guo ⋅ Fayao Liu ⋅ Jiashi Feng ⋅ Guosheng Lin
ExHall D Poster #13
CamPoint: Boosting Point Cloud Segmentation with Virtual Camera Poster Session 3
Jianhui Zhang ⋅ Luo Yizhi ⋅ Zicheng Zhang ⋅ Xuecheng Nie ⋅ Bonan Li
ExHall D Poster #114
Towards Generalizable Scene Change Detection Poster Session 5
Jae-Woo KIM ⋅ Ue-Hwan Kim
ExHall D Poster #328
LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions Poster Session 1
Faridoun Mehri ⋅ Mahdieh Baghshah ⋅ Mohammad Taher Pilehvar
ExHall D Poster #397
LightLoc: Learning Outdoor LiDAR Localization at Light Speed Poster Session 2
Wen Li ⋅ Chen Liu ⋅ Shangshu Yu ⋅ dq Liu ⋅ Yin Zhou ⋅ Siqi Shen ⋅ Chenglu Wen ⋅ Cheng Wang
ExHall D Poster #125
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection Poster Session 5
Gensheng Pei ⋅ Tao Chen ⋅ Yujia Wang ⋅ Xinhao Cai ⋅ Xiangbo Shu ⋅ Tianfei Zhou ⋅ Yazhou Yao
ExHall D Poster #366
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification Poster Session 1
Dongseob Kim ⋅ Hyunjung Shim
ExHall D Poster #430
UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units Poster Session 2
Huakun Liu ⋅ Hiroki Ota ⋅ Xin Wei ⋅ Yutaro Hirao ⋅ Monica Perusquia-Hernandez ⋅ Hideaki Uchiyama ⋅ Kiyoshi Kiyokawa
ExHall D Poster #165
STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models Poster Session 5
Koushik Srivatsan ⋅ Fahad Shamshad ⋅ Muzammal Naseer ⋅ Vishal M. Patel ⋅ Karthik Nandakumar
ExHall D Poster #263
GenVDM: Generating Vector Displacement Maps From a Single Image Poster Session 6
Yuezhi Yang ⋅ Qimin Chen ⋅ Vladimir G. Kim ⋅ Siddhartha Chaudhuri ⋅ Qixing Huang ⋅ Zhiqin Chen
ExHall D Poster #44
Effective SAM Combination for Open-Vocabulary Semantic Segmentation Poster Session 6
Minhyeok Lee ⋅ Suhwan Cho ⋅ Jungho Lee ⋅ Sunghun Yang ⋅ Heeseung Choi ⋅ Ig-Jae Kim ⋅ Sangyoun Lee
ExHall D Poster #383
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding Poster Session 3
Pedro Hermosilla ⋅ Christian Stippel ⋅ Leon Sick
ExHall D Poster #400
ScribbleLight: Single Image Indoor Relighting with Scribbles Poster Session 2
Jun Myeong Choi ⋅ Annie N. Wang ⋅ Pieter Peers ⋅ Anand Bhattad ⋅ Roni Sengupta
ExHall D Poster #26
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration Poster Session 4
Haipeng Fang ⋅ Sheng Tang ⋅ Juan Cao ⋅ Enshuo Zhang ⋅ Fan Tang ⋅ Tong-yee Lee
ExHall D Poster #217
Quantization without Tears Poster Session 1
Minghao Fu ⋅ Hao Yu ⋅ Jie Shao ⋅ Junjie Zhou ⋅ Ke Zhu ⋅ Jianxin Wu
ExHall D Poster #412
Turbo3D: Ultra-fast Text-to-3D Generation Poster Session 5
Hanzhe Hu ⋅ Tianwei Yin ⋅ Fujun Luan ⋅ Yiwei Hu ⋅ Hao Tan ⋅ Zexiang Xu ⋅ Sai Bi ⋅ Shubham Tulsiani ⋅ Kai Zhang
ExHall D Poster #252
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes Poster Session 5
Weixiao Gao ⋅ Liangliang Nan ⋅ Hugo Ledoux
ExHall D Poster #329
Higher-Order Ratio Cycles for Fast and Globally Optimal Shape Matching Poster Session 5
Paul Roetzer ⋅ Viktoria Ehm ⋅ Daniel Cremers ⋅ Zorah Lähner ⋅ Florian Bernard
ExHall D Poster #71
Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception Poster Session 5
Luke Chen ⋅ Junyao Wang ⋅ Trier Mortlock ⋅ Pramod Khargonekar ⋅ Mohammad Al Faruque
ExHall D Poster #120
EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching Poster Session 2
Dongki Jung ⋅ Jaehoon Choi ⋅ Yonghan Lee ⋅ Somi Jeong ⋅ Taejae Lee ⋅ Dinesh Manocha ⋅ Suyong Yeon
ExHall D Poster #92
O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models Poster Session 4
Ashshak Sharifdeen ⋅ Muhammad Akhtar Munir ⋅ Sanoojan Baliah ⋅ Salman Khan ⋅ Muhammad Haris Khan
ExHall D Poster #394
SVFR: A Unified Framework for Generalized Video Face Restoration Poster Session 2
Zhiyao Wang ⋅ Xu Chen ⋅ Chengming Xu ⋅ Junwei Zhu ⋅ Xiaobin Hu ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yuqi Liu ⋅ Yiyi Zhou ⋅ Rongrong Ji
ExHall D Poster #195
Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection Poster Session 4
Jia Guo ⋅ Shuai Lu ⋅ Weihang Zhang ⋅ Fang Chen ⋅ Hongen Liao ⋅ Huiqi Li
ExHall D Poster #438
Test-Time Visual In-Context Tuning Poster Session 4
Jiahao Xie ⋅ Alessio Tonioni ⋅ Nathalie Rauschmayr ⋅ Federico Tombari ⋅ Bernt Schiele
ExHall D Poster #400
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Poster Session 3
Shijie Zhou ⋅ Hui Ren ⋅ Yijia Weng ⋅ Shuwang Zhang ⋅ Zhen Wang ⋅ Dejia Xu ⋅ Zhiwen Fan ⋅ Suya You ⋅ Zhangyang Wang ⋅ Leonidas Guibas ⋅ Achuta Kadambi
ExHall D Poster #338
Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models Poster Session 3
Hao Ren ⋅ Yiming Zeng ⋅ Zetong Bi ⋅ Zhaoliang Wan ⋅ Junlong Huang ⋅ Hui Cheng
ExHall D Poster #140
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images Poster Session 3
Kaiyu Li ⋅ Ruixun Liu ⋅ Xiangyong Cao ⋅ Xueru Bai ⋅ Feng Zhou ⋅ Deyu Meng ⋅ Wang Zhi
ExHall D Poster #319
EchoONE: Segmenting Multiple Echocardiography Planes in One Model Poster Session 1
Jiongtong Hu ⋅ Wei Zhuo ⋅ Jun Cheng ⋅ YINGYING LIU ⋅ Wufeng Xue ⋅ Dong Ni
ExHall D Poster #482
Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation Poster Session 4
Kendong Liu ⋅ Zhiyu Zhu ⋅ Hui LIU ⋅ Junhui Hou
ExHall D Poster #212
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild Poster Session 2
Yumeng Liu ⋅ Xiaoxiao Long ⋅ Zemin Yang ⋅ Yuan Liu ⋅ Marc Habermann ⋅ Christian Theobalt ⋅ Yuexin Ma ⋅ Wenping Wang
ExHall D Poster #160
MVSAnywhere: Zero-Shot Multi-View Stereo Poster Session 3
Sergio Izquierdo ⋅ Mohamed Sayed ⋅ Michael Firman ⋅ Guillermo Garcia-Hernando ⋅ Daniyar Turmukhambetov ⋅ Javier Civera ⋅ Oisin Mac Aodha ⋅ Gabriel Brostow ⋅ Jamie Watson
ExHall D Poster #81
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Poster Session 1
ruotian peng ⋅ Haiying He ⋅ Yake Wei ⋅ Yandong Wen ⋅ Di Hu
ExHall D Poster #361
Attribute-Missing Multi-view Graph Clustering Poster Session 5
Bowen Zhao ⋅ Qianqian Wang ⋅ Zhengming Ding ⋅ Quanxue Gao
ExHall D Poster #461
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision Poster Session 4
Tomoya Yoshida ⋅ Shuhei Kurita ⋅ Taichi Nishimura ⋅ Shinsuke Mori
ExHall D Poster #151
FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting Poster Session 3
Fangyu Wu ⋅ Yuhao Chen
ExHall D Poster #38
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space Poster Session 4
Jianrong Zhang ⋅ Hehe Fan ⋅ Yi Yang
ExHall D Poster #171
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation Poster Session 4
Hang Yin ⋅ Xiuwei Xu ⋅ Linqing Zhao ⋅ Ziwei Wang ⋅ Jie Zhou ⋅ Jiwen Lu
ExHall D Poster #311
Structure-Aware Correspondence Learning for Relative Pose Estimation Poster Session 3
Yihan Chen ⋅ Wenfei Yang ⋅ Huan Ren ⋅ Shifeng Zhang ⋅ Tianzhu Zhang ⋅ Feng Wu
ExHall D Poster #93
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs Poster Session 5
Zixuan Hu ⋅ Yongxian Wei ⋅ Li Shen ⋅ Chun Yuan ⋅ Dacheng Tao
ExHall D Poster #381
FIFA: Fine-grained Inter-frame Attention for Driver's Video Gaze Estimation Poster Session 4
Daosong Hu ⋅ Mingyue Cui ⋅ Kai Huang
ExHall D Poster #284
Monocular and Generalizable Gaussian Talking Head Animation Poster Session 2
Shengjie Gong ⋅ Haojie Li ⋅ Jiapeng Tang ⋅ Dongming Hu ⋅ Shuangping Huang ⋅ Hao Chen ⋅ Tianshui Chen ⋅ Zhuoman Liu
ExHall D Poster #7
Rethinking Token Reduction with Parameter-Efficient Fine-Tuning in ViT for Pixel-Level Tasks Poster Session 3
Cheng Lei ⋅ Ao Li ⋅ Hu Yao ⋅ Ce Zhu ⋅ Le Zhang
ExHall D Poster #412
SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective Fusion Poster Session 4
Xuan Zhu ⋅ Jijun Xiang ⋅ Xianqi Wang ⋅ Longliang Liu ⋅ Yu Wang ⋅ Hong Zhang ⋅ Fei Guo ⋅ Xin Yang
ExHall D Poster #75
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis Poster Session 5
Khiem Vuong ⋅ Anurag Ghosh ⋅ Deva Ramanan ⋅ Srinivasa G. Narasimhan ⋅ Shubham Tulsiani
ExHall D Poster #59
Towards Training-free Anomaly Detection with Vision and Language Foundation Models Poster Session 3
Jinjin Zhang ⋅ Guodong Wang ⋅ yizhou jin ⋅ Di Huang
ExHall D Poster #436
LiVOS: Light Video Object Segmentation with Gated Linear Matching Poster Session 2
Qin Liu ⋅ Jianfeng Wang ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Kevin Lin ⋅ Marc Niethammer ⋅ Lijuan Wang
ExHall D Poster #315
Dynamic Content Prediction with Motion-aware Priors for Blind Face Video Restoration Poster Session 4
Lianxin Xie ⋅ csbingbing zheng ⋅ Si Wu ⋅ Hau San Wong
ExHall D Poster #192
HalLoc: Token-level Localization of Hallucinations for Vision Language Models Poster Session 6
Eunkyu Park ⋅ Minyeong Kim ⋅ Gunhee Kim
ExHall D Poster #358
DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis Poster Session 6
Yuming Gu ⋅ Phong Tran ⋅ Yujian Zheng ⋅ Hongyi Xu ⋅ Heyuan Li ⋅ Adilbek Karmanov ⋅ Hao Li
ExHall D Poster #6
Plug-and-Play Versatile Compressed Video Enhancement Poster Session 4
Huimin Zeng ⋅ Jiacheng Li ⋅ Zhiwei Xiong
ExHall D Poster #187
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting Poster Session 2
Dongliang Luo ⋅ Hanshen Zhu ⋅ Ziyang Zhang ⋅ Dingkang Liang ⋅ Xudong Xie ⋅ Yuliang Liu ⋅ Xiang Bai
ExHall D Poster #377
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models Poster Session 5
Tianwei Yin ⋅ Qiang Zhang ⋅ Richard Zhang ⋅ William Freeman ⋅ Fredo Durand ⋅ Eli Shechtman ⋅ Xun Huang
ExHall D Poster #181
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution Poster Session 3
Zhu Li Bo ⋅ Jianze Li ⋅ Haotong Qin ⋅ Wenbo Li ⋅ Yulun Zhang ⋅ Yong Guo ⋅ Xiaokang Yang
ExHall D Poster #203
RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting Poster Session 4
Qiyu Dai ⋅ Xingyu Ni ⋅ Qianfan Shen ⋅ Mengyu Chu ⋅ Wenzheng Chen ⋅ Baoquan Chen
ExHall D Poster #29
Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis Poster Session 5
Boming Miao ⋅ Chunxiao Li ⋅ Xiaoxiao Wang ⋅ Andi Zhang ⋅ Rui Sun ⋅ Zizhe Wang ⋅ Yao Zhu
ExHall D Poster #241
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction Poster Session 5
Wenyuan Zhang ⋅ Yixiao Yang ⋅ Han Huang ⋅ Liang Han ⋅ Kanle Shi ⋅ Yu-Shen Liu ⋅ Zhizhong Han
ExHall D Poster #56
Three-view Focal Length Recovery From Homographies Poster Session 3
Yaqing Ding ⋅ Viktor Kocur ⋅ Zuzana Berger Haladova ⋅ Qianliang Wu ⋅ Shen Cai ⋅ Jian Yang ⋅ Zuzana Kukelova
ExHall D Poster #82
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models Poster Session 3
Haoran Hao ⋅ Jiaming Han ⋅ Changsheng Li ⋅ Yu-Feng Li ⋅ Xiangyu Yue
ExHall D Poster #371
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation Poster Session 1
Tianyun Zhong ⋅ Chao Liang ⋅ Jianwen Jiang ⋅ Gaojie Lin ⋅ Jiaqi Yang ⋅ Zhou Zhao
ExHall D Poster #281
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Poster Session 6
Rundi Wu ⋅ Ruiqi Gao ⋅ Ben Poole ⋅ Alex Trevithick ⋅ Changxi Zheng ⋅ Jonathan T. Barron ⋅ Aleksander Holynski
ExHall D Poster #53
Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment Poster Session 6
Guanglu Dong ⋅ Xiangyu Liao ⋅ Mingyang Li ⋅ Guihuan Guo ⋅ Chao Ren
ExHall D Poster #192
Distilling Long-tailed Datasets Poster Session 6
Zhenghao Zhao ⋅ Haoxuan Wang ⋅ Yuzhang Shang ⋅ Kai Wang ⋅ Yan Yan
ExHall D Poster #427
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders Poster Session 6
Fiona Ryan ⋅ Ajay Bati ⋅ Sangmin Lee ⋅ Daniel Bolya ⋅ Judy Hoffman ⋅ James Rehg
ExHall D Poster #258
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning Poster Session 5
Hairui Ren ⋅ Fan Tang ⋅ He Zhao ⋅ Zixuan Wang ⋅ Dandan Guo ⋅ Yi Chang
ExHall D Poster #391
Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization Poster Session 6
Dongkwan Lee ⋅ Kyomin Hwang ⋅ Nojun Kwak
ExHall D Poster #426
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation Poster Session 1
Ruineng Li ⋅ Daitao Xing ⋅ Huiming Sun ⋅ Yuanzhou Ha ⋅ Jinglin Shen ⋅ Chiuman Ho
ExHall D Poster #165
CholecTrack20: A Multi-Perspective Tracking Dataset for Surgical Tools Poster Session 2
Chinedu Innocent Nwoye ⋅ Kareem elgohary ⋅ Anvita A. Srinivas ⋅ Fauzan Zaid ⋅ Joël L. Lavanchy ⋅ Nicolas Padoy
ExHall D Poster #342
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression Poster Session 4
Hsiang-Wei Huang ⋅ Fu-Chen Chen ⋅ Wenhao Chai ⋅ Che-Chun Su ⋅ Lu Xia ⋅ Sanghun Jung ⋅ Cheng-Yen Yang ⋅ Jenq-Neng Hwang ⋅ Min Sun ⋅ Cheng-Hao Kuo
ExHall D Poster #345
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning Poster Session 4
Huajie Jiang ⋅ Zhengxian Li ⋅ Xiaohan Yu ⋅ Yongli Hu ⋅ Baocai Yin ⋅ Jian Yang ⋅ Yuankai Qi
ExHall D Poster #426
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion Poster Session 6
Zexin He ⋅ Tengfei Wang ⋅ Xin Huang ⋅ Xingang Pan ⋅ Ziwei Liu
ExHall D Poster #34
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling Poster Session 4
Zeyue Tian ⋅ Zhaoyang Liu ⋅ Ruibin Yuan ⋅ Jiahao Pan ⋅ Qifeng Liu ⋅ Xu Tan ⋅ Qifeng Chen ⋅ Wei Xue ⋅ Yike Guo
ExHall D Poster #286
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification Poster Session 3
Yang Qin ⋅ Chao Chen ⋅ Zhihang Fu ⋅ Dezhong Peng ⋅ Xi Peng ⋅ Peng Hu
ExHall D Poster #357
Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales Poster Session 1
Shuokai Pan ⋅ Gerti Tuzi ⋅ Sudarshan Sreeram ⋅ Dibakar Gope
ExHall D Poster #374
EdgeDiff: Edge-aware Diffusion Network for Building Reconstruction from Point Clouds Poster Session 4
Yujun Liu ⋅ Ruisheng Wang ⋅ Shangfeng Huang ⋅ GuoRong Cai
ExHall D Poster #114
A Dataset for Semantic Segmentation in the Presence of Unknowns Poster Session 1
Zakaria Laskar ⋅ Tomas Vojir ⋅ Matej Grcic ⋅ Iaroslav Melekhov ⋅ Shankar Gangisetty ⋅ Juho Kannala ⋅ Jiri Matas ⋅ Giorgos Tolias ⋅ C.V. Jawahar
ExHall D Poster #119
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding Poster Session 2
Shehreen Azad ⋅ Vibhav Vineet ⋅ Yogesh S. Rawat
ExHall D Poster #303
Explainable Saliency: Articulating Reasoning with Contextual Prioritization Poster Session 2
Nuo Chen ⋅ Ming Jiang ⋅ Qi Zhao
ExHall D Poster #403
DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning Poster Session 5
Kun Zhang ⋅ Jingyu Li ⋅ Zhe Li ⋅ S Kevin Zhou
ExHall D Poster #378
Task-Aware Clustering for Prompting Vision-Language Models Poster Session 3
Fusheng Hao ⋅ Fengxiang He ⋅ Fuxiang Wu ⋅ Tichao Wang ⋅ Chengqun Song ⋅ Jun Cheng
ExHall D Poster #392
CASP: Compression of Large Multimodal Models Based on Attention Sparsity Poster Session 2
Mohsen Gholami ⋅ Mohammad Akbari ⋅ Kevin Cannons ⋅ Yong Zhang
ExHall D Poster #381
Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning Poster Session 2
Tianxiang Yin ⋅ Ningzhong Liu ⋅ Han Sun
ExHall D Poster #456
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing Poster Session 6
Pengcheng Xu ⋅ Boyuan Jiang ⋅ Xiaobin Hu ⋅ Donghao Luo ⋅ Qingdong He ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yunsheng Wu ⋅ Charles Ling ⋅ Boyu Wang
ExHall D Poster #221
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Poster Session 4
Ludwic Leonard ⋅ Nils Thuerey ⋅ rüdiger westermann
ExHall D Poster #30
DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation Poster Session 6
Maregu Assefa ⋅ Muzammal Naseer ⋅ IYYAKUTTI IYAPPAN GANAPATHI ⋅ Syed Sadaf Ali ⋅ Mohamed L Seghier ⋅ Naoufel Werghi
ExHall D Poster #450
STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification Poster Session 3
Siyi Du ⋅ Xinzhe Luo ⋅ Declan ORegan ⋅ Chen Qin
ExHall D Poster #469
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers Poster Session 6
Mert Bülent Sarıyıldız ⋅ Philippe Weinzaepfel ⋅ Thomas Lucas ⋅ Pau de Jorge ⋅ Diane Larlus ⋅ Yannis Kalantidis
ExHall D Poster #376
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models Poster Session 6
Runhui Huang ⋅ Xinpeng Ding ⋅ Chunwei Wang ⋅ Jianhua Han ⋅ Yulong Liu ⋅ Hengshuang Zhao ⋅ Hang Xu ⋅ Lu Hou ⋅ Wei Zhang ⋅ Xiaodan Liang
ExHall D Poster #351
Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis Poster Session 2
Yu Yuan ⋅ Xijun Wang ⋅ Yichen Sheng ⋅ Prateek Chennuri ⋅ Xingguang Zhang ⋅ Stanley H. Chan
ExHall D Poster #244
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning Poster Session 5
Ren Wang ⋅ Haoliang Sun ⋅ Yuxiu Lin ⋅ Chuanhui Zuo ⋅ Yongshun Gong ⋅ Yilong Yin ⋅ Wenjia Meng
ExHall D Poster #460
Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization Poster Session 1
Maxime Pietrantoni ⋅ Gabriela Csurka ⋅ Torsten Sattler
ExHall D Poster #85
Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities Poster Session 2
Michele Mazzamuto ⋅ Antonino Furnari ⋅ Yoichi Sato ⋅ Giovanni Maria Farinella
ExHall D Poster #281
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport Poster Session 1
Hao Tan ⋅ Zichang Tan ⋅ Jun Li ⋅ Ajian Liu ⋅ Jun Wan ⋅ Zhen Lei
ExHall D Poster #429
RelationField: Relate Anything in Radiance Fields Poster Session 5
Sebastian Koch ⋅ Johanna Wald ⋅ Mirco Colosi ⋅ Narunas Vaskevicius ⋅ Pedro Hermosilla ⋅ Federico Tombari ⋅ Timo Ropinski
ExHall D Poster #62
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding Poster Session 2
Geng Li ⋅ Jinglin Xu ⋅ Yunzhen Zhao ⋅ Yuxin Peng
ExHall D Poster #356
From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration Poster Session 2
Mingyang Song ⋅ Xiaoye Qu ⋅ Jiawei Zhou ⋅ Yu Cheng
ExHall D Poster #387
DEIM: DETR with Improved Matching for Fast Convergence Poster Session 3
Shihua Huang ⋅ Zhichao Lu ⋅ Xiaodong Cun ⋅ Yongjun YU ⋅ Xiao Zhou ⋅ Xi Shen
ExHall D Poster #432
BF-STVSR: B-Splines and Fourier---Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution Poster Session 6
Eunjin Kim ⋅ HYEONJIN KIM ⋅ Kyong Hwan Jin ⋅ Jaejun Yoo
ExHall D Poster #175
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis Poster Session 6
Jiangtong Tan ⋅ Hu Yu ⋅ Jie Huang ⋅ Jie Xiao ⋅ Feng Zhao
ExHall D Poster #172
Hierarchical Adaptive Filtering Network for Text Image Specular Highlight Removal Poster Session 1
Zhi Jiang ⋅ Jingbo Hu ⋅ Ling Zhang ⋅ Gang Fu ⋅ Chunxia Xiao
ExHall D Poster #211
Improving Semi-Supervised Semantic Segmentation with Sliced-Wasserstein Feature Alignment and Uniformity Poster Session 4
Chen Yi Lu ⋅ Kasra Derakhshandeh ⋅ Somali Chaterji
ExHall D Poster #422
Mind the Time: Temporally-Controlled Multi-Event Video Generation Poster Session 5
Ziyi Wu ⋅ Aliaksandr Siarohin ⋅ Willi Menapace ⋅ Ivan Skorokhodov ⋅ Yuwei Fang ⋅ Varnith Chordia ⋅ Igor Gilitschenski ⋅ Sergey Tulyakov
ExHall D Poster #284
NADER: Neural Architecture Design via Multi-Agent Collaboration Poster Session 1
Zekang Yang ⋅ Wang ZENG ⋅ Sheng Jin ⋅ Chen Qian ⋅ Ping Luo ⋅ Wentao Liu
ExHall D Poster #411
Move-in-2D: 2D-Conditioned Human Motion Generation Poster Session 5
Hsin-Ping Huang ⋅ Yang Zhou ⋅ Jui-Hsien Wang ⋅ Difan Liu ⋅ Feng Liu ⋅ Ming-Hsuan Yang ⋅ Zhan Xu
ExHall D Poster #162
PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation Poster Session 3
Uyoung Jeong ⋅ Jonathan Freer ⋅ Seungryul Baek ⋅ Hyung Jin Chang ⋅ Kwang In Kim
ExHall D Poster #156
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering Poster Session 5
Zhen Yang ⋅ Zhuo Tao ⋅ Qi Chen ⋅ Yuankai Qi ⋅ Liang Li ⋅ Anton van den Hengel ⋅ Qingming Huang
ExHall D Poster #356
Decision SpikeFormer: Spike-Driven Transformer for Decision Making Poster Session 4
Wei Huang ⋅ Qinying Gu ⋅ Nanyang Ye
ExHall D Poster #328
SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding Poster Session 6
Yangliu Hu ⋅ Zikai Song ⋅ Na Feng ⋅ Yawei Luo ⋅ Junqing Yu ⋅ Yi-Ping Phoebe Chen ⋅ Wei Yang
ExHall D Poster #283
Theory-Inspired Deep Multi-View Multi-Label Learning with Incomplete Views and Noisy Labels Poster Session 4
Quanjiang Li ⋅ Tingjin Luo ⋅ Jiahui Liao
ExHall D Poster #466
Fitted Neural Lossless Image Compression Poster Session 5
Zhe Zhang ⋅ Zhenzhong Chen ⋅ Shan Liu
ExHall D Poster #209
Question-Aware Gaussian Experts for Audio-Visual Question Answering Poster Session 3
Hongyeob Kim ⋅ Inyoung Jung ⋅ Dayoon Suh ⋅ Youjia Zhang ⋅ Sangmin Lee ⋅ Sungeun Hong
ExHall D Poster #290
Multitwine: Multi-Object Compositing with Text and Layout Control Poster Session 2
Gemma Canet Tarrés ⋅ Zhe Lin ⋅ Zhifei Zhang ⋅ He Zhang ⋅ Andrew Gilbert ⋅ John Collomosse ⋅ Soo Ye Kim
ExHall D Poster #260
Adaptive Rectangular Convolution for Remote Sensing Pansharpening Poster Session 4
Xueyang Wang ⋅ Zhixin Zheng ⋅ Jiandong Shao ⋅ Yule Duan ⋅ Liang-Jian Deng
ExHall D Poster #197
Video Depth without Video Models Poster Session 2
Bingxin Ke ⋅ Dominik Narnhofer ⋅ Shengyu Huang ⋅ Lei Ke ⋅ Torben Peters ⋅ Katerina Fragkiadaki ⋅ Anton Obukhov ⋅ Konrad Schindler
ExHall D Poster #179
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning Poster Session 2
Song Wang ⋅ Xiaolu Liu ⋅ Lingdong Kong ⋅ Jianyun Xu ⋅ Chunyong Hu ⋅ Gongfan Fang ⋅ Wentong Li ⋅ Jianke Zhu ⋅ Xinchao Wang
ExHall D Poster #118
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset Poster Session 1
Zedong Chu ⋅ Feng Xiong ⋅ Meiduo Liu ⋅ Jinzhi Zhang ⋅ Mingqi Shao ⋅ Zhaoxu Sun ⋅ Di Wang ⋅ Mu Xu
ExHall D Poster #13
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models Poster Session 4
Yuning Han ⋅ Bingyin Zhao ⋅ Rui Chu ⋅ Feng Luo ⋅ Biplab Sikdar ⋅ Yingjie Lao
ExHall D Poster #323
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding Poster Session 6
feilong tang ⋅ Chengzhi Liu ⋅ Zhongxing Xu ⋅ Ming Hu ⋅ Zile Huang ⋅ Haochen Xue ⋅ Ziyang Chen ⋅ Zelin Peng ⋅ Zhiwei Yang ⋅ Sijin Zhou ⋅ Wenxue Li ⋅ Yulong Li ⋅ Wenxuan Song ⋅ Shiyan Su ⋅ Wei Feng ⋅ Jionglong Su ⋅ Mingquan Lin ⋅ Yifan Peng ⋅ Xuelian Cheng ⋅ Imran Razzak ⋅ Zongyuan Ge
ExHall D Poster #272
Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing Poster Session 4
Yanjun Li ⋅ Zhaoyang Li ⋅ Honghui Chen ⋅ li'Zhi Xu
ExHall D Poster #310
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes Poster Session 2
JunYong Choi ⋅ Min-Cheol Sagong ⋅ SeokYeong Lee ⋅ Seung-Won Jung ⋅ Ig-Jae Kim ⋅ Junghyun Cho
ExHall D Poster #31
Targeted Forgetting of Image Subgroups in CLIP Models Poster Session 2
Zeliang Zhang ⋅ Gaowen Liu ⋅ Charles Fleming ⋅ Ramana Kompella ⋅ Chenliang Xu
ExHall D Poster #428
Harnessing Frozen Unimodal Encoders for Flexible Multimodal Alignment Poster Session 6
Mayug Maniparambil ⋅ Raiymbek Akshulakov ⋅ YASSER ABDELAZIZ DAHOU DJILALI ⋅ Sanath Narayan ⋅ Ankit Singh ⋅ Noel O'Connor
ExHall D Poster #354
Enhancing Diversity for Data-free Quantization Poster Session 5
Kai Zhao ⋅ zhihao zhuang ⋅ Miao Zhang ⋅ Chenjuan Guo ⋅ Yang Shu ⋅ Bin Yang
ExHall D Poster #425
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model Poster Session 1
Chunlin Yu ⋅ Hanqing Wang ⋅ Ye Shi ⋅ Haoyang Luo ⋅ Sibei Yang ⋅ Jingyi Yu ⋅ Jingya Wang
ExHall D Poster #142
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Poster Session 1
Amin Karimi ⋅ Charalambos Poullis
ExHall D Poster #423
Revisiting Generative Replay for Class Incremental Object Detection Poster Session 4
Shizhou Zhang ⋅ Xueqiang Lv ⋅ Yinghui Xing ⋅ Qirui Wu ⋅ Di Xu ⋅ Yanning Zhang
ExHall D Poster #432
Bridging Viewpoint Gaps: Geometric Reasoning Boosts Semantic Correspondence Poster Session 3
Qiyang Qian ⋅ Hansheng Chen ⋅ Masayoshi Tomizuka ⋅ Kurt Keutzer ⋅ Qianqian Wang ⋅ Chenfeng Xu
ExHall D Poster #90
Memories of Forgotten Concepts Poster Session 1
Matan Rusanovsky ⋅ Shimon Malnick ⋅ Amir Jevnisek ⋅ Ohad Fried ⋅ Shai Avidan
ExHall D Poster #268
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Poster Session 6
Eduard Poesina ⋅ Adriana Valentina Costache ⋅ Adrian-Gabriel Chifu ⋅ Josiane Mothe ⋅ Radu Tudor Ionescu
ExHall D Poster #237
CheXwhatsApp: A Dataset for Exploring Challenges in the Diagnosis of Chest X-rays through Mobile Devices Poster Session 5
Mariamma Antony ⋅ Rajiv Porana ⋅ Sahil M. Lathiya ⋅ Siva Teja Kakileti ⋅ Chiranjib Bhattacharyya
ExHall D Poster #466
Degradation-Aware Feature Perturbation for All-in-One Image Restoration Poster Session 6
Xiangpeng Tian ⋅ Xiangyu Liao ⋅ Xiao Liu ⋅ Meng Li ⋅ Chao Ren
ExHall D Poster #191
ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning Poster Session 6
Haoyuan Yang ⋅ Xiaoou Li ⋅ Jiaming Lv ⋅ Xianjun Cheng ⋅ Qilong Wang ⋅ Peihua Li
ExHall D Poster #370
Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness Poster Session 5
Beier Zhu ⋅ Jiequan Cui ⋅ Hanwang Zhang ⋅ Chi Zhang
ExHall D Poster #424
Implicit Bias Injection Attacks against Text-to-Image Diffusion Models Poster Session 6
Huayang Huang ⋅ Xiangye Jin ⋅ Jiaxu Miao ⋅ Yu Wu
ExHall D Poster #249
ROICtrl: Boosting Instance Control for Visual Generation Poster Session 5
Yuchao Gu ⋅ Yipin Zhou ⋅ Yunfan Ye ⋅ Yixin Nie ⋅ Licheng Yu ⋅ Pingchuan Ma ⋅ Kevin Qinghong Lin ⋅ Mike Zheng Shou
ExHall D Poster #251
WonderWorld: Interactive 3D Scene Generation from a Single Image Poster Session 2
Hong-Xing Yu ⋅ Haoyi Duan ⋅ Charles Herrmann ⋅ William Freeman ⋅ Jiajun Wu
ExHall D Poster #45
A Lightweight UDF Learning Framework for 3D Reconstruction Based on Local Shape Functions Poster Session 1
Jiangbei Hu ⋅ Yanggeng Li ⋅ Fei Hou ⋅ Junhui Hou ⋅ Zhebin Zhang ⋅ Shengfa Wang ⋅ Na Lei ⋅ Ying He
ExHall D Poster #105
DiffCAM: Data-Driven Saliency Maps by Capturing Feature Differences Poster Session 2
Xingjian Li ⋅ Qiming Zhao ⋅ Neelesh Bisht ⋅ Mostofa Uddin Uddin ⋅ Jin Yu Kim ⋅ Bryan Zhang ⋅ Min Xu
ExHall D Poster #472
PolarNeXt: Rethink Instance Segmentation with Polar Representation Poster Session 4
Jiacheng Sun ⋅ Xinghong Zhou ⋅ Yiqiang Wu ⋅ Bin Zhu ⋅ Jiaxuan Lu ⋅ Yu Qin ⋅ Xiaomao Li
ExHall D Poster #335
SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model Poster Session 4
Chongkai Yu ⋅ Ting Liu ⋅ Li Anqi ⋅ Xiaochao Qu ⋅ WU CHENGJING ⋅ Luoqi Liu ⋅ Xiaolin Hu
ExHall D Poster #339
DarkIR: Robust Low-Light Image Restoration Poster Session 3
Daniel Feijoo ⋅ Juan C. Benito ⋅ Alvaro Garcia ⋅ Marcos Conde
ExHall D Poster #21
R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner Poster Session 4
Ziyi Bai ⋅ Hanxuan Li ⋅ Bin Fu ⋅ Chuyan Xiong ⋅ Ruiping Wang ⋅ Xilin Chen
ExHall D Poster #348
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models Poster Session 5
Fernando Julio Cendra ⋅ Kai Han
ExHall D Poster #260
MultiMorph: On-demand Atlas Construction Poster Session 6
Mazdak Abulnaga ⋅ Andrew Hoopes ⋅ Neel Dey ⋅ Malte Hoffmann ⋅ Bruce Fischl ⋅ John Guttag ⋅ Adrian V. Dalca
ExHall D Poster #455
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling Poster Session 4
Jinhong Lin ⋅ Cheng-En Wu ⋅ Huanran Li ⋅ Jifan Zhang ⋅ Yu Hen Hu ⋅ Pedro Morgado
ExHall D Poster #403
Synthetic Visual Genome Poster Session 2
Jae Sung Park ⋅ Zixian Ma ⋅ Linjie Li ⋅ Chenhao Zheng ⋅ Cheng-Yu Hsieh ⋅ Ximing Lu ⋅ Khyathi Chandu ⋅ Quan Kong ⋅ Norimasa Kobori ⋅ Ali Farhadi ⋅ Yejin Choi ⋅ Ranjay Krishna
ExHall D Poster #354
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation Poster Session 4
Hyunsoo Kim ⋅ Donghyun Kim ⋅ Suhyun Kim
ExHall D Poster #234
Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding Poster Session 6
Wei Suo ⋅ Lijun Zhang ⋅ Mengyang Sun ⋅ Lin Yuanbo Wu ⋅ Peng Wang ⋅ Yanning Zhang
ExHall D Poster #359
ShiftwiseConv: Small Convolutional Kernel with Large Kernel Effect Poster Session 5
Dachong Li ⋅ li li ⋅ zhuangzhuang chen ⋅ Jianqiang Li
ExHall D Poster #405
RealEdit: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations Poster Session 3
Peter Sushko ⋅ Ayana Bharadwaj ⋅ Zhi Yang Lim ⋅ Vasily Ilin ⋅ Ben Caffee ⋅ Dongping Chen ⋅ Reza Salehi ⋅ Cheng-Yu Hsieh ⋅ Ranjay Krishna
ExHall D Poster #263
MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting Poster Session 4
jun huang ⋅ Ting Liu ⋅ Yihang Wu ⋅ Xiaochao Qu ⋅ Luoqi Liu ⋅ Xiaolin Hu
ExHall D Poster #241
Can Generative Video Models Help Pose Estimation? Poster Session 4
Ruojin Cai ⋅ Jason Y. Zhang ⋅ Philipp Henzler ⋅ Zhengqi Li ⋅ Noah Snavely ⋅ Ricardo Martin
ExHall D Poster #90
DreamOmni: Unified Image Generation and Editing Poster Session 6
Bin Xia ⋅ Yuechen Zhang ⋅ Jingyao Li ⋅ Chengyao Wang ⋅ Yitong Wang ⋅ Xinglong Wu ⋅ Bei Yu ⋅ Jiaya Jia
ExHall D Poster #226
Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark Poster Session 6
Hao Guo ⋅ Xugong Qin ⋅ Jun Jie Ou Yang ⋅ peng zhang ⋅ Gangyan Zeng ⋅ Yubo Li ⋅ Hailun Lin
ExHall D Poster #343
PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models Poster Session 5
Jenny Schmalfuss ⋅ Nadine Chang ⋅ Vibashan VS ⋅ Maying Shen ⋅ Andrés Bruhn ⋅ Jose M. Alvarez
ExHall D Poster #386
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh Poster Session 1
Jingyu Zhuang ⋅ Di Kang ⋅ Linchao Bao ⋅ Liang Lin ⋅ Guanbin Li
ExHall D Poster #12
Towards Source-Free Machine Unlearning Poster Session 1
Sk Miraj Ahmed ⋅ Umit Basaran ⋅ Dripta S. Raychaudhuri ⋅ Arindam Dutta ⋅ Rohit Kundu ⋅ Fahim Faisal Niloy ⋅ Basak Guler ⋅ Amit K. Roy-Chowdhury
ExHall D Poster #457
Fractal Calibration for Long-tailed Object Detection Poster Session 3
Konstantinos Alexandridis ⋅ Ismail Elezi ⋅ Jiankang Deng ⋅ Anh Nguyen ⋅ Shan Luo
ExHall D Poster #430
Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition Poster Session 5
Chengxiang Huang ⋅ Yake Wei ⋅ Zequn Yang ⋅ Di Hu
ExHall D Poster #463
Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement Poster Session 5
Hesong Li ⋅ Ziqi Wu ⋅ Ruiwen Shao ⋅ Tao Zhang ⋅ Ying Fu
ExHall D Poster #23
FedSPA: Generalizable Federated Graph Learning under Homophily Heterogeneity Poster Session 3
Zihan Tan ⋅ Guancheng Wan ⋅ Wenke Huang ⋅ Guibin Zhang ⋅ He Li ⋅ Carl Yang ⋅ Mang Ye
ExHall D Poster #461
PRaDA: Projective Radial Distortion Averaging Poster Session 5
Daniil Sinitsyn ⋅ Linus Härenstam-Nielsen ⋅ Daniel Cremers
ExHall D Poster #81
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Poster Session 2
Jiantao Lin ⋅ Xin Yang ⋅ Meixi Chen ⋅ Xu Yingjie ⋅ Dongyu Yan ⋅ Leyi Wu ⋅ Xinli Xu ⋅ Lie XU ⋅ Shunsi Zhang ⋅ Ying-Cong Chen
ExHall D Poster #41
Feature Information Driven Position Gaussian Distribution Estimation for Tiny Object Detection Poster Session 6
Jinghao Bian ⋅ Mingtao Feng ⋅ Weisheng Dong ⋅ Fangfang Wu ⋅ Jianqiao Luo ⋅ Yaonan Wang ⋅ Guangming Shi
ExHall D Poster #405
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity Poster Session 6
Ke Ma ⋅ Jiaqi Tang ⋅ Bin Guo ⋅ Fan Dang ⋅ Sicong Liu ⋅ Zhui Zhu ⋅ Lei Wu ⋅ Cheng Fang ⋅ Ying-Cong Chen ⋅ Zhiwen Yu ⋅ Yunhao Liu
ExHall D Poster #418
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways Poster Session 1
Yi Liu ⋅ Hao Zhou ⋅ Benlei Cui ⋅ Wenxiang Shang ⋅ Ran Lin
ExHall D Poster #212
MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval Poster Session 5
Reno Kriz ⋅ Kate Sanders ⋅ David Etter ⋅ Kenton Murray ⋅ Cameron Carpenter ⋅ Hannah Recknor ⋅ Jimena Guallar-Blasco ⋅ Alexander Martin ⋅ Eugene Yang ⋅ Benjamin Van Durme
ExHall D Poster #299
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models Poster Session 3
Taha Koleilat ⋅ Hojat Asgariandehkordi ⋅ Hassan Rivaz ⋅ Yiming Xiao
ExHall D Poster #394
Shape and Texture: What Influences Reliable Optical Flow Estimation? Poster Session 6
Libo Long ⋅ Xiao Hu ⋅ Jochen Lang
ExHall D Poster #164
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors Poster Session 4
Juil Koo ⋅ Paul Guerrero ⋅ Chun-Hao P. Huang ⋅ Duygu Ceylan ⋅ Minhyuk Sung
ExHall D Poster #180
Resilient Sensor Fusion Under Adverse Sensor Failures via Multi-Modal Expert Fusion Poster Session 2
Konyul Park ⋅ Yecheol Kim ⋅ Daehun Kim ⋅ Jun Won Choi
ExHall D Poster #129
Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution Poster Session 6
Siwei Tu ⋅ Ben Fei ⋅ Weidong Yang ⋅ Fenghua Ling ⋅ Hao Chen ⋅ Zili Liu ⋅ Kun Chen ⋅ Hang Fan ⋅ Wanli Ouyang ⋅ Lei Bai
ExHall D Poster #181
Event-Equalized Dense Video Captioning Poster Session 2
Kangyi Wu ⋅ Pengna Li ⋅ Jingwen Fu ⋅ Yizhe Li ⋅ Yang Wu ⋅ Yuhan Liu ⋅ Jinjun Wang ⋅ Sanping Zhou
ExHall D Poster #291
Multirate Neural Image Compression with Adaptive Lattice Vector Quantization Poster Session 2
Hao Xu ⋅ Xiaolin Wu ⋅ Xi Zhang
ExHall D Poster #216
VidTwin: Video VAE with Decoupled Structure and Dynamics Poster Session 5
Yuchi Wang ⋅ Junliang Guo ⋅ Xinyi Xie ⋅ Tianyu He ⋅ Xu Sun ⋅ Jiang Bian
ExHall D Poster #177
Reconstructing People, Places, and Cameras Poster Session 5
Lea Müller ⋅ Hongsuk Choi ⋅ Anthony Zhang ⋅ Brent Yi ⋅ Jitendra Malik ⋅ Angjoo Kanazawa
ExHall D Poster #86
Evaluating Model Perception of Color Illusions in Photorealistic Scenes Poster Session 2
Lingjun Mao ⋅ Zineng Tang ⋅ Alane Suhr
ExHall D Poster #233
Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing Poster Session 5
Yoonjeon Kim ⋅ Soohyun Ryu ⋅ Yeonsung Jung ⋅ Hyunkoo Lee ⋅ Joowon Kim ⋅ June Yong Yang ⋅ Jaeryong Hwang ⋅ Eunho Yang
ExHall D Poster #231
ProReflow: Progressive Reflow with Decomposed Velocity Poster Session 6
Lei Ke ⋅ Haohang Xu ⋅ Xuefei Ning ⋅ Yu Li ⋅ Jiajun Li ⋅ Haoling Li ⋅ Yuxuan Lin ⋅ Dongsheng Jiang ⋅ Yujiu Yang ⋅ Linfeng Zhang
ExHall D Poster #177
Object-aware Sound Source Localization via Audio-Visual Scene Understanding Poster Session 2
Sung Jin Um ⋅ Dongjin Kim ⋅ Sangmin Lee ⋅ Jung Uk Kim
ExHall D Poster #284
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models Poster Session 5
Junhyuk So ⋅ Jiwoong Shin ⋅ Chaeyeon Jang ⋅ Eunhyeok Park
ExHall D Poster #216
Towards Precise Scaling Laws for Video Diffusion Transformers Poster Session 4
Yuanyang Yin ⋅ Yaqi Zhao ⋅ Mingwu Zheng ⋅ Ke Lin ⋅ Jiarong Ou ⋅ Rui Chen ⋅ Victor Shea-Jay Huang ⋅ Jiahao Wang ⋅ Xin Tao ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Baoqun Yin ⋅ Wentao Zhang ⋅ Kun Gai
ExHall D Poster #224
VideoGEM: Training-free Action Grounding in Videos Poster Session 1
Felix Vogel ⋅ Walid Bousselham ⋅ Anna Kukleva ⋅ Nina Shvetsova ⋅ Hilde Kuehne
ExHall D Poster #306
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment Poster Session 6
Yunhong Lu ⋅ Qichao Wang ⋅ Hengyuan Cao ⋅ Xierui Wang ⋅ Xiaoyin Xu ⋅ Min Zhang
ExHall D Poster #235
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation Poster Session 1
Aleksei Bokhovkin ⋅ Quan Meng ⋅ Shubham Tulsiani ⋅ Angela Dai
ExHall D Poster #43
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text Poster Session 1
Roberto Henschel ⋅ Levon Khachatryan ⋅ Hayk Poghosyan ⋅ Daniil Hayrapetyan ⋅ Vahram Tadevosyan ⋅ Zhangyang Wang ⋅ Shant Navasardyan ⋅ Humphrey Shi
ExHall D Poster #230
DefMamba: Deformable Visual State Space Model Poster Session 2
Leiye Liu ⋅ Miao Zhang ⋅ Jihao Yin ⋅ Tingwei Liu ⋅ Wei Ji ⋅ Yongri Piao ⋅ Huchuan Lu
ExHall D Poster #331
Color Alignment in Diffusion Poster Session 6
Ka Chun SHUM ⋅ Binh-Son Hua ⋅ Thanh Nguyen ⋅ Sai-Kit Yeung
ExHall D Poster #218
Hand-held Object Reconstruction from RGB Video with Dynamic Interaction Poster Session 3
Shijian Jiang ⋅ Qi Ye ⋅ Rengan Xie ⋅ Yuchi Huo ⋅ Jiming Chen
ExHall D Poster #151
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI Poster Session 3
Sangmin Lee ⋅ Sungyong Park ⋅ Heewon Kim
ExHall D Poster #146
Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation Poster Session 3
Yiping Wang ⋅ Xuehai He ⋅ Kuan Wang ⋅ Luyao Ma ⋅ Jianwei Yang ⋅ Shuohang Wang ⋅ Simon Shaolei Du ⋅ yelong shen
ExHall D Poster #284
Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture Poster Session 1
Xuanchen Li ⋅ Jianyu Wang ⋅ Yuhao Cheng ⋅ Yikun Zeng ⋅ Xingyu Ren ⋅ Wenhan Zhu ⋅ Weiming Zhao ⋅ Yichao Yan
ExHall D Poster #4
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Poster Session 3
Haoyi Jiang ⋅ Liu Liu ⋅ Tianheng Cheng ⋅ Xinjie wang ⋅ Tianwei Lin ⋅ Zhizhong Su ⋅ Wenyu Liu ⋅ Xinggang Wang
ExHall D Poster #127
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment Poster Session 5
Soumya Suvra Ghosal ⋅ Souradip Chakraborty ⋅ Vaibhav Singh ⋅ Tianrui Guan ⋅ Mengdi Wang ⋅ Ahmad Beirami ⋅ Furong Huang ⋅ Alvaro Velasquez ⋅ Dinesh Manocha ⋅ Amrit Singh Bedi
ExHall D Poster #382
Efficient Video Super-Resolution for Real-time Rendering with Decoupled G-buffer Guidance Poster Session 3
Mingjun Zheng ⋅ Long Sun ⋅ Jiangxin Dong ⋅ Jinshan Pan
ExHall D Poster #64
ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos Poster Session 5
Zetong Zhang ⋅ Manuel Kaufmann ⋅ Lixin Xue ⋅ Jie Song ⋅ Martin R. Oswald
ExHall D Poster #74
Generative Video Propagation Poster Session 4
Shaoteng Liu ⋅ Tianyu Wang ⋅ Jui-Hsien Wang ⋅ Qing Liu ⋅ Zhifei Zhang ⋅ Joon-Young Lee ⋅ Yijun Li ⋅ Bei Yu ⋅ Zhe Lin ⋅ Soo Ye Kim ⋅ Jiaya Jia
ExHall D Poster #182
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting Poster Session 2
Alex Hanson ⋅ Allen Tu ⋅ Vasu Singla ⋅ Bethmage Mayuka Jayawardhana ⋅ Matthias Zwicker ⋅ Tom Goldstein
ExHall D Poster #48
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR Poster Session 5
Xugong Qin ⋅ peng zhang ⋅ Jun Jie Ou Yang ⋅ Gangyan Zeng ⋅ Yubo Li ⋅ Yuanyuan Wang ⋅ Wanqian Zhang ⋅ Pengwen Dai
ExHall D Poster #367
DreamTrack: Dreaming the Future for Multimodal Visual Object Tracking Poster Session 2
Mingzhe Guo ⋅ Weiping Tan ⋅ Wenyu Ran ⋅ Liping Jing ⋅ Zhipeng Zhang
ExHall D Poster #176
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models Poster Session 2
Chen Chen ⋅ Daochang Liu ⋅ Mubarak Shah ⋅ Chang Xu
ExHall D Poster #268
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization Poster Session 3
Zhanhao Liang ⋅ Yuhui Yuan ⋅ Shuyang Gu ⋅ Bohan CHEN ⋅ Tiankai Hang ⋅ Mingxi Cheng ⋅ Ji Li ⋅ Liang Zheng
ExHall D Poster #243
Learning Textual Prompts for Open-World Semi-Supervised Learning Poster Session 3
Yuxin Fan ⋅ Junbiao Cui ⋅ Jiye Liang
ExHall D Poster #393
VITED: Video Temporal Evidence Distillation Poster Session 2
Yujie Lu ⋅ Yale Song ⋅ Lorenzo Torresani ⋅ William Yang Wang ⋅ Tushar Nagarajan
ExHall D Poster #298
Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model Poster Session 6
Yuting Zhang ⋅ Hao Lu ⋅ Qingyong Hu ⋅ Yin Wang ⋅ Kaishen Yuan ⋅ Xin Liu ⋅ Kaishun Wu
ExHall D Poster #295
PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection Poster Session 2
Wei Li ⋅ Pin-Yu Chen ⋅ Sijia Liu ⋅ Ren Wang
ExHall D Poster #465
Improve Representation for Imbalanced Regression through Geometric Constraints Poster Session 1
Zijian Dong ⋅ Yilei Wu ⋅ Chongyao Chen ⋅ Yingtian Zou ⋅ Yichi Zhang ⋅ Juan Helen Zhou
ExHall D Poster #470
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding Poster Session 2
seil kang ⋅ Jinyeong Kim ⋅ Junhyeok Kim ⋅ Seong Jae Hwang
ExHall D Poster #378
AnySat: One Earth Observation Model for Many Resolutions, Scales, and Modalities Poster Session 4
Guillaume Astruc ⋅ Nicolas Gonthier ⋅ Clement Mallet ⋅ Loic Landrieu
ExHall D Poster #355
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation Poster Session 3
Aviral Chharia ⋅ Wenbo Gou ⋅ Haoye Dong
ExHall D Poster #91
JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems Poster Session 1
Yifan Wang ⋅ Jian Zhao ⋅ Zhaoxin Fan ⋅ Xin Zhang ⋅ Xuecheng Wu ⋅ Yudian Zhang ⋅ Lei Jin ⋅ Xinyue Li ⋅ Gang Wang ⋅ Mengxi Jia ⋅ Ping Hu ⋅ Zheng Zhu ⋅ Xuelong Li
ExHall D Poster #137
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Poster Session 1
Lingen Li ⋅ Zhaoyang Zhang ⋅ Yaowei Li ⋅ Jiale Xu ⋅ Wenbo Hu ⋅ Xiaoyu Li ⋅ Weihao Cheng ⋅ Jinwei Gu ⋅ Tianfan Xue ⋅ Ying Shan
ExHall D Poster #57
Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target. Poster Session 1
Dokyoon Yoon ⋅ Youngsook Song ⋅ Woomyoung Park
ExHall D Poster #385
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior Poster Session 1
Haitao Wu ⋅ Qing Li ⋅ Changqing Zhang ⋅ Zhen He ⋅ Xiaomin Ying
ExHall D Poster #196
VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning Poster Session 6
Haoran Xu ⋅ Peixi Peng ⋅ Guang Tan ⋅ Yiqian Chang ⋅ Luntong Li ⋅ Yonghong Tian
ExHall D Poster #323
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents Poster Session 2
Yunseok Jang ⋅ Yeda Song ⋅ Sungryull Sohn ⋅ Lajanugen Logeswaran ⋅ Tiange Luo ⋅ Dong-Ki Kim ⋅ GyungHoon Bae ⋅ Honglak Lee
ExHall D Poster #308
Multi-Modal Aerial-Ground Cross-View Place Recognition with Neural ODEs Poster Session 3
Sijie Wang ⋅ Rui She ⋅ Qiyu Kang ⋅ Siqi Li ⋅ Disheng Li ⋅ Tianyu Geng ⋅ Shangshu Yu ⋅ Wee Peng Tay
ExHall D Poster #103
AniGrad: Anisotropic Gradient-Adaptive Sampling for 3D Reconstruction From Monocular Video Poster Session 5
Noah Stier ⋅ Alex Rich ⋅ Pradeep Sen ⋅ Tobias Höllerer
ExHall D Poster #73
Progressive Focused Transformer for Single Image Super-Resolution Poster Session 1
Wei Long ⋅ Xingyu Zhou ⋅ Leheng Zhang ⋅ Shuhang Gu
ExHall D Poster #199
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation Poster Session 2
Reza Qorbani ⋅ Gianluca Villani ⋅ Theodoros Panagiotakopoulos ⋅ Marc Botet Colomer ⋅ Linus Härenstam-Nielsen ⋅ Mattia Segu ⋅ Pier Luigi Dovesi ⋅ Jussi Karlgren ⋅ Daniel Cremers ⋅ Federico Tombari ⋅ Matteo Poggi
ExHall D Poster #422
Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training Poster Session 2
Lexington Whalen ⋅ Zhenbang Du ⋅ Haoran You ⋅ Chaojian Li ⋅ Sixu Li ⋅ Yingyan (Celine) Lin
ExHall D Poster #221
Token Cropr: Faster ViTs for Quite a Few Tasks Poster Session 2
Benjamin Bergner ⋅ Christoph Lippert ⋅ Aravindh Mahendran
ExHall D Poster #416
ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting Poster Session 6
Guo Junfu ⋅ Yu Xin ⋅ Gaoyi Liu ⋅ Kai Xu ⋅ Ligang Liu ⋅ Ruizhen Hu
ExHall D Poster #95
Detecting Out-of-Distribution Through the Lens of Neural Collapse Poster Session 3
Litian Liu ⋅ Yao Qin
ExHall D Poster #457
Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features Poster Session 6
Yuanbo Xiangli ⋅ Ruojin Cai ⋅ Hanyu Chen ⋅ Jeffrey Byrne ⋅ Noah Snavely
ExHall D Poster #97
DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation Poster Session 3
Sang-Jun Park ⋅ Keun-Soo Heo ⋅ Dong-Hee Shin ⋅ Young-Han Son ⋅ Ji-Hye Oh ⋅ Tae-Eui Kam
ExHall D Poster #472
Argus: A Compact and Versatile Foundation Model for Vision Poster Session 1
Weiming Zhuang ⋅ Chen Chen ⋅ Zhizhong Li ⋅ Sina Sajadmanesh ⋅ Jingtao Li ⋅ Jiabo Huang ⋅ Vikash Sehwag ⋅ Vivek Sharma ⋅ Hirotaka Shinozaki ⋅ Felan Carlo Garcia ⋅ Yihao Zhan ⋅ Naohiro Adachi ⋅ Ryoji Eki ⋅ Michael Spranger ⋅ Peter Stone ⋅ Lingjuan Lyu
ExHall D Poster #408
Multi-View Pose-Agnostic Change Localization with Zero Labels Poster Session 3
Chamuditha Jayanga Galappaththige ⋅ Jason Lai ⋅ Lloyd Windrim ⋅ Donald G. Dansereau ⋅ Niko Suenderhauf ⋅ Dimity Miller
ExHall D Poster #92
TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond Poster Session 6
Kun Zhou ⋅ Xinyu Lin ⋅ Jiangbo Lu
ExHall D Poster #187
Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture Poster Session 4
Kenkun Liu ⋅ Yurong Fu ⋅ Weihao Yuan ⋅ Jing Lin ⋅ Peihao Li ⋅ Xiaodong Gu ⋅ Lingteng Qiu ⋅ Haoqian Wang ⋅ Zilong Dong ⋅ Xiaoguang Han
ExHall D Poster #165
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding Poster Session 5
Yujie Liang ⋅ Xiaobin Hu ⋅ Boyuan Jiang ⋅ Donghao Luo ⋅ Xu Peng ⋅ Kai WU ⋅ Chengming Xu ⋅ Wenhui Han ⋅ Taisong Jin ⋅ Chengjie Wang ⋅ Rongrong Ji
ExHall D Poster #148
Any6D: Model-free 6D Pose Estimation of Novel Object Poster Session 3
Taeyeop Lee ⋅ Bowen Wen ⋅ Minjun Kang ⋅ Gyuree Kang ⋅ In So Kweon ⋅ Kuk-Jin Yoon
ExHall D Poster #95
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation Poster Session 1
Zhiqiang Shen ⋅ Ammar Sherif ⋅ Zeyuan Yin ⋅ Shitong Shao
ExHall D Poster #443
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail Poster Session 1
Luca Bartolomei ⋅ Fabio Tosi ⋅ Matteo Poggi ⋅ Stefano Mattoccia
ExHall D Poster #79
SpiritSight Agent: Advanced GUI Agent with One Look Poster Session 6
Zhiyuan Huang ⋅ Ziming Cheng ⋅ Junting Pan ⋅ Zhaohui Hou ⋅ Mingjie Zhan
ExHall D Poster #319
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers Poster Session 4
Hang Zhou ⋅ Xinxin Zuo ⋅ Rui Ma ⋅ Li Cheng
ExHall D Poster #333
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects Poster Session 5
Weimin Qiu ⋅ Jieke Wang ⋅ Meng Tang
ExHall D Poster #236
3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations Poster Session 5
yating wang ⋅ Xuan Wang ⋅ Ran Yi ⋅ Yanbo Fan ⋅ Jichen Hu ⋅ Jingcheng Zhu ⋅ Lizhuang Ma
ExHall D Poster #7
Dual Focus-Attention Transformer for Robust Point Cloud Registration Poster Session 3
Kexue Fu ⋅ Ming'zhi Yuan ⋅ Changwei Wang ⋅ Weiguang Pang ⋅ Jing Chi ⋅ Manning Wang ⋅ Longxiang Gao
ExHall D Poster #108
Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions Poster Session 4
Tianhao Ma ⋅ Han Chen ⋅ Juncheng Hu ⋅ Yungang Zhu ⋅ Ximing Li
ExHall D Poster #455
SMTPD: A New Benchmark for Temporal Prediction of Social Media Popularity Poster Session 4
Yijie Xu ⋅ Bolun Zheng ⋅ Wei Zhu ⋅ Hangjia Pan ⋅ Yuchen Yao ⋅ Ning Xu ⋅ An-An Liu ⋅ Quan Zhang ⋅ Chenggang Yan
ExHall D Poster #292
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model Poster Session 2
Changchang Sun ⋅ Gaowen Liu ⋅ Charles Fleming ⋅ Yan Yan
ExHall D Poster #282
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification Poster Session 2
S P Sharan ⋅ Minkyu Choi ⋅ Sahil Shah ⋅ Harsh Goel ⋅ Mohammad Omama ⋅ Sandeep P. Chinchali
ExHall D Poster #289
Spherical Manifold Guided Diffusion Model for Panoramic Image Generation Poster Session 2
Xiancheng Sun ⋅ Mai Xu ⋅ Shengxi Li ⋅ Senmao Ma ⋅ Xin Deng ⋅ Lai Jiang ⋅ Shen gang
ExHall D Poster #36
Learning on Model Weights using Tree Experts Poster Session 4
Eliahu Horwitz ⋅ Bar Cavia ⋅ Jonathan Kahana ⋅ Yedid Hoshen
ExHall D Poster #444
Rethinking Query-based Transformer for Continual Image Segmentation Poster Session 1
Yuchen Zhu ⋅ Cheng Shi ⋅ Dingyou Wang ⋅ Jiajin Tang ⋅ Zhengxuan Wei ⋅ Yu Wu ⋅ Guanbin Li ⋅ Sibei Yang
ExHall D Poster #424
Image Reconstruction from Readout-Multiplexed Single-Photon Detector Arrays Poster Session 3
Shashwath Bharadwaj ⋅ Ruangrawee Kitichotkul ⋅ Akshay Agarwal ⋅ Vivek K Goyal
ExHall D Poster #71
Towards Smart Point-and-Shoot Photography Poster Session 6
Jiawan Li ⋅ Fei Zhou ⋅ Zhipeng Zhong ⋅ Jiongzhi Lin ⋅ Guoping Qiu
ExHall D Poster #198
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding Poster Session 1
Ying Chen ⋅ Guoan Wang ⋅ Yuanfeng Ji ⋅ Yanjun Li ⋅ Jin Ye ⋅ Tianbin Li ⋅ Ming Hu ⋅ Rongshan Yu ⋅ Yu Qiao ⋅ Junjun He
ExHall D Poster #475
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance Poster Session 4
Shulei Wang ⋅ Wang Lin ⋅ Hai Huang ⋅ Hanting Wang ⋅ Sihang Cai ⋅ WenKang Han ⋅ Tao Jin ⋅ Jingyuan Chen ⋅ Jiacheng Sun ⋅ Jieming Zhu ⋅ Zhou Zhao
ExHall D Poster #256
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation Poster Session 3
Andrea Maracani ⋅ Savas Ozkan ⋅ Sijun Cho ⋅ Hyo-Won Kim ⋅ Eunchung Noh ⋅ Jeongwon Min ⋅ Cho Jung Min ⋅ Dookun Park ⋅ Mete Ozay
ExHall D Poster #369
On the Consistency of Video Large Language Models in Temporal Comprehension Poster Session 3
Minjoon Jung ⋅ Junbin Xiao ⋅ Byoung-Tak Zhang ⋅ Angela Yao
ExHall D Poster #293
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation Poster Session 5
Jiaming Zhou ⋅ Teli Ma ⋅ Kun-Yu Lin ⋅ Zifan Wang ⋅ Ronghe Qiu ⋅ Junwei Liang
ExHall D Poster #142
One-Minute Video Generation with Test-Time Training Poster Session 4
Jiarui Xu ⋅ Shihao Han ⋅ Karan Dalal ⋅ Daniel Koceja ⋅ Yue Zhao ⋅ Ka Chun Cheung ⋅ Yejin Choi ⋅ Jan Kautz ⋅ Yu Sun ⋅ Xiaolong Wang
ExHall D Poster #181
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding Poster Session 1
Wenxuan Guo ⋅ Xiuwei Xu ⋅ Ziwei Wang ⋅ Jianjiang Feng ⋅ Jie Zhou ⋅ Jiwen Lu
ExHall D Poster #333
Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space Poster Session 2
Leonhard Sommer ⋅ Olaf Dünkel ⋅ Christian Theobalt ⋅ Adam Kortylewski
ExHall D Poster #104
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Poster Session 1
Hongjie Wang ⋅ Chih-Yao Ma ⋅ Yen-Cheng Liu ⋅ Ji Hou ⋅ Tao Xu ⋅ Jialiang Wang ⋅ Felix Juefei-Xu ⋅ Yaqiao Luo ⋅ Peizhao Zhang ⋅ Tingbo Hou ⋅ Peter Vajda ⋅ Niraj Jha ⋅ Xiaoliang Dai
ExHall D Poster #231
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting Poster Session 3
Dong In Lee ⋅ Hyeongcheol Park ⋅ Jiyoung Seo ⋅ Eunbyung Park ⋅ Hyunje Park ⋅ Ha Dam Baek ⋅ Shin sangheon ⋅ sangmin kim ⋅ Sangpil Kim
ExHall D Poster #46
SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language Poster Session 6
zehan wang ⋅ Sashuai zhou ⋅ Shaoxuan He ⋅ Haifeng Huang ⋅ Lihe Yang ⋅ Ziang Zhang ⋅ Xize Cheng ⋅ Shengpeng Ji ⋅ Tao Jin ⋅ Hengshuang Zhao ⋅ Zhou Zhao
ExHall D Poster #336
How to Merge Your Multimodal Models Over Time? Poster Session 4
Sebastian Dziadzio ⋅ Vishaal Udandarao ⋅ Karsten Roth ⋅ Ameya Prabhu ⋅ Zeynep Akata ⋅ Samuel Albanie ⋅ Matthias Bethge
ExHall D Poster #445
Identifying and Mitigating Position Bias of Multi-image Vision-Language Models Poster Session 3
Xinyu Tian ⋅ Shu Zou ⋅ Zhaoyuan Yang ⋅ Jing Zhang
ExHall D Poster #376
Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning Poster Session 6
Tim Lenz ⋅ Peter Neidlinger ⋅ Marta Ligero ⋅ Georg Wölflein ⋅ Marko van Treeck ⋅ Jakob Nikolas Kather
ExHall D Poster #446
MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation Poster Session 6
Huaize Liu ⋅ WenZhang Sun ⋅ Donglin Di ⋅ Shibo Sun ⋅ Jiahui Yang ⋅ Hujun Bao ⋅ Changqing Zou
ExHall D Poster #2
ReCap: Better Gaussian Relighting with Cross-Environment Captures Poster Session 5
Jingzhi Li ⋅ Zongwei Wu ⋅ Eduard Zamfir ⋅ Radu Timofte
ExHall D Poster #25
Split Adaptation for Pre-trained Vision Transformers Poster Session 4
Lixu Wang ⋅ Bingqi Shang ⋅ Yi Li ⋅ Payal Mohapatra ⋅ Wei Dong ⋅ Xiao Wang ⋅ Qi Zhu
ExHall D Poster #409
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models Poster Session 4
Wufei Ma ⋅ Luoxin Ye ⋅ Nessa McWeeney ⋅ Celso M. de Melo ⋅ Alan L. Yuille ⋅ Jieneng Chen
ExHall D Poster #137
Consistency Posterior Sampling for Diverse Image Synthesis Poster Session 6
Vishal Purohit ⋅ Matthew Repasky ⋅ Jianfeng Lu ⋅ Qiang Qiu ⋅ Yao Xie ⋅ Xiuyuan Cheng
ExHall D Poster #206
IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement Poster Session 6
Zhihao Shi ⋅ Dong Huo ⋅ Yuhongze Zhou ⋅ Yan Min ⋅ Juwei Lu ⋅ Xinxin Zuo
ExHall D Poster #51
ActiveGAMER: Active GAussian Mapping through Efficient Rendering Poster Session 4
Liyan Chen ⋅ Huangying Zhan ⋅ Kevin Chen ⋅ Xiangyu Xu ⋅ Qingan Yan ⋅ Changjiang Cai ⋅ Yi Xu
ExHall D Poster #62
DeepCompress-ViT: Rethinking Model Compression to Enhance Efficiency of Vision Transformers at the Edge Poster Session 6
Sabbir Ahmed ⋅ Abdullah Al Arafat ⋅ Deniz Najafi ⋅ Akhlak Mahmood ⋅ Mamshad Nayeem Rizve ⋅ Mohaiminul Al Nahian ⋅ RANYANG ZHOU ⋅ Shaahin Angizi ⋅ Adnan Rakin Rakin
ExHall D Poster #382
EvOcc: Accurate Semantic Occupancy for Automated Driving Using Evidence Theory Poster Session 6
Jonas Kälble ⋅ Sascha Wirges ⋅ Maxim Tatarchenko ⋅ Eddy Ilg
ExHall D Poster #125
Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising Poster Session 4
Tong Li ⋅ Lizhi Wang ⋅ Zhiyuan Xu ⋅ Lin Zhu ⋅ Wanxuan Lu ⋅ Hua Huang
ExHall D Poster #202
PGC: Physics-Based Gaussian Cloth from a Single Pose Poster Session 5
Michelle Guo ⋅ Matt Jen-Yuan Chiang ⋅ Igor Santesteban ⋅ Nikolaos Sarafianos ⋅ Hsiaoyu Chen ⋅ Oshri Halimi ⋅ Aljaž Božič ⋅ Shunsuke Saito ⋅ Jiajun Wu ⋅ Karen Liu ⋅ Tuur Stuyck ⋅ Egor Larionov
ExHall D Poster #16
Joint Vision-Language Social Bias Removal for CLIP Poster Session 1
Haoyu Zhang ⋅ Yangyang Guo ⋅ Mohan Kankanhalli
ExHall D Poster #389
StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation Poster Session 6
Shangjin Zhai ⋅ Zhichao Ye ⋅ Jialin Liu ⋅ Weijian Xie ⋅ Jiaqi Hu ⋅ Zhen Peng ⋅ Hua Xue ⋅ Danpeng Chen ⋅ Xiaomeng Wang ⋅ Lei Yang ⋅ Nan Wang ⋅ Haomin Liu ⋅ Guofeng Zhang
ExHall D Poster #65
MonSter: Marry Monodepth to Stereo Unleashes Power Poster Session 2
JunDa Cheng ⋅ Longliang Liu ⋅ Gangwei Xu ⋅ Xianqi Wang ⋅ Zhaoxing Zhang ⋅ Yong Deng ⋅ Jinliang Zang ⋅ Yurui Chen ⋅ zhipeng cai ⋅ Xin Yang
ExHall D Poster #82
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting Poster Session 4
Shu-Wei Lu ⋅ Yi-Hsuan Tsai ⋅ Yi-Ting Chen
ExHall D Poster #125
Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models Poster Session 4
Reza Shirkavand ⋅ Peiran Yu ⋅ Shangqian Gao ⋅ Gowthami Somepalli ⋅ Tom Goldstein ⋅ Heng Huang
ExHall D Poster #271
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Poster Session 3
Jianhao Zheng ⋅ Zihan Zhu ⋅ Valentin Bieri ⋅ Marc Pollefeys ⋅ Songyou Peng ⋅ Iro Armeni
ExHall D Poster #76
A Tale of Two Classes: Adapting Supervised Contrastive Learning to Binary Imbalanced Datasets Poster Session 2
David Mildenberger ⋅ Paul Hager ⋅ Daniel Rueckert ⋅ Martin J. Menten
ExHall D Poster #470
DEAL: Data-Efficient Adversarial Learning for High-Quality Infrared Imaging Poster Session 6
Zhu Liu ⋅ Zijun Wang ⋅ Jinyuan Liu ⋅ Fanqi Meng ⋅ Long Ma ⋅ Risheng Liu
ExHall D Poster #194
RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance Poster Session 3
Yuheng Jiang ⋅ Zhehao Shen ⋅ Chengcheng Guo ⋅ Yu Hong ⋅ Zhuo Su ⋅ Yingliang Zhang ⋅ Marc Habermann ⋅ Lan Xu
ExHall D Poster #66
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Poster Session 4
Yang Yue ⋅ Yulin Wang ⋅ Chenxin Tao ⋅ Pan Liu ⋅ Shiji Song ⋅ Gao Huang
ExHall D Poster #473
Re-thinking Temporal Search for Long-Form Video Understanding Poster Session 2
Jinhui Ye ⋅ Zihan Wang ⋅ Haosen Sun ⋅ Keshigeyan Chandrasegaran ⋅ Zane Durante ⋅ Cristobal Eyzaguirre ⋅ Yonatan Bisk ⋅ Juan Carlos Niebles ⋅ Ehsan Adeli ⋅ Li Fei-Fei ⋅ Jiajun Wu ⋅ Manling Li
ExHall D Poster #306
RivuletMLP: An MLP-based Architecture for Efficient Compressed Video Quality Enhancement Poster Session 2
Gang He ⋅ Weiran Wang ⋅ Guancheng Quan ⋅ Shihao Wang ⋅ Dajiang Zhou ⋅ Yunsong Li
ExHall D Poster #189
OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints Poster Session 4
Mingjie Pan ⋅ Jiyao Zhang ⋅ Tianshu Wu ⋅ Yinghao Zhao ⋅ Wenlong Gao ⋅ Hao Dong
ExHall D Poster #150
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models Poster Session 6
Itay Benou ⋅ Tammy Riklin Raviv
ExHall D Poster #374
Reanimating Images using Neural Representations of Dynamic Stimuli Poster Session 2
Jacob Yeung ⋅ Andrew Luo ⋅ Gabriel Sarch ⋅ Margaret Marie Henderson ⋅ Deva Ramanan ⋅ Michael J. Tarr
ExHall D Poster #220
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Poster Session 2
Chaehun Shin ⋅ Jooyoung Choi ⋅ Heeseung Kim ⋅ Sungroh Yoon
ExHall D Poster #250
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors Poster Session 4
Riku Murai ⋅ Eric Dexheimer ⋅ Andrew J. Davison
ExHall D Poster #83
Cross-modal Information Flow in Multimodal Large Language Models Poster Session 4
Zhi Zhang ⋅ Srishti Yadav ⋅ Fengze Han ⋅ Ekaterina Shutova
ExHall D Poster #379
Consistent and Controllable Image Animation with Motion Diffusion Models Poster Session 2
Xin Ma ⋅ Yaohui Wang ⋅ Gengyun Jia ⋅ Xinyuan Chen ⋅ Tien-Tsin Wong ⋅ Yuan-Fang Li ⋅ Cunjian Chen
ExHall D Poster #184
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards Poster Session 5
Zijing Hu ⋅ Fengda Zhang ⋅ Long Chen ⋅ Kun Kuang ⋅ Jiahui Li ⋅ Kaifeng Gao ⋅ Jun Xiao ⋅ Xin Wang ⋅ Wenwu Zhu
ExHall D Poster #245
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models Poster Session 5
Xingrui Wang ⋅ Wufei Ma ⋅ Tiezheng Zhang ⋅ Celso M. de Melo ⋅ Jieneng Chen ⋅ Alan L. Yuille
ExHall D Poster #348
Omnidirectional Multi-Object Tracking Poster Session 5
Kai Luo ⋅ Hao Shi ⋅ Sheng Wu ⋅ Fei Teng ⋅ Mengfei Duan ⋅ Chang Huang ⋅ Yuhang Wang ⋅ Kaiwei Wang ⋅ Kailun Yang
ExHall D Poster #87
Potential Field Based Deep Metric Learning Poster Session 5
Shubhang Bhatnagar ⋅ Narendra Ahuja
ExHall D Poster #431
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver Poster Session 2
Cong Wei ⋅ Haoxian Tan ⋅ Yujie Zhong ⋅ Yong Liu ⋅ Jie Hu ⋅ Dengjie Li ⋅ Zheng Zhao ⋅ Yujiu Yang
ExHall D Poster #341
Diffusion-based Event Generation for High-Quality Image Deblurring Poster Session 1
Xinan Xie ⋅ Qing Zhang ⋅ Wei-Shi Zheng
ExHall D Poster #190
Video Summarization with Large Language Models Poster Session 4
Min Jung Lee ⋅ Dayoung Gong ⋅ Minsu Cho
ExHall D Poster #304
Sketchtopia: A Dataset and Foundational Agents for Benchmarking Asynchronous Multimodal Communication with Iconic Feedback Poster Session 4
Mohd Hozaifa Khan ⋅ Ravi Kiran Sarvadevabhatla
ExHall D Poster #226
Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes Poster Session 3
Hyeonggon Ryu ⋅ Seongyu Kim ⋅ Joon Chung ⋅ Arda Senocak
ExHall D Poster #276
Consistency-aware Self-Training for Iterative-based Stereo Matching Poster Session 4
Jingyi Zhou ⋅ Peng Ye ⋅ Haoyu Zhang ⋅ Jiakang Yuan ⋅ Rao Qiang ⋅ Liu YangChenXu ⋅ Wu Cailin ⋅ Feng Xu ⋅ Tao Chen
ExHall D Poster #77
Balanced Rate-Distortion Optimization in Learned Image Compression Poster Session 1
Yichi Zhang ⋅ Zhihao Duan ⋅ Yuning Huang ⋅ Fengqing Zhu
ExHall D Poster #213
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer Poster Session 2
Ziyi Liu ⋅ Yangcen Liu
ExHall D Poster #319
HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion Poster Session 5
Ding Ding ⋅ Yueming Pan ⋅ Ruoyu Feng ⋅ Qi Dai ⋅ Kai Qiu ⋅ Jianmin Bao ⋅ Chong Luo ⋅ Zhenzhong Chen
ExHall D Poster #180
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model Poster Session 4
Zhaochong An ⋅ Guolei Sun ⋅ Yun Liu ⋅ Runjia Li ⋅ Junlin Han ⋅ Ender Konukoglu ⋅ Serge Belongie
ExHall D Poster #113
Online Task-Free Continual Learning via Dynamic Expansionable Memory Distribution Poster Session 4
Fei Ye ⋅ Adrian Bors
ExHall D Poster #448
Seeing is Not Believing: Adversarial Natural Object Optimization for Hard-Label 3D Scene Attacks Poster Session 3
Daizong Liu ⋅ Wei Hu
ExHall D Poster #120
Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants Poster Session 3
Chong Yu ⋅ Tao Chen ⋅ Zhongxue Gan
ExHall D Poster #389
Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach Poster Session 2
Chen-Chen Zong ⋅ Sheng-Jun Huang
ExHall D Poster #455
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation Poster Session 4
Fangyun Wei ⋅ Jinjing Zhao ⋅ Kun Yan ⋅ Chang Xu
ExHall D Poster #334
Do Visual Imaginations Improve Vision-and-Language Navigation Agents? Poster Session 1
Akhil Perincherry ⋅ Jacob Krantz ⋅ Stefan Lee
ExHall D Poster #350
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds Poster Session 2
Jiahui Lei ⋅ Yijia Weng ⋅ Adam W Harley ⋅ Leonidas Guibas ⋅ Kostas Daniilidis
ExHall D Poster #70
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking Poster Session 4
Haolin Qin ⋅ Tingfa Xu ⋅ Tianhao Li ⋅ Zhenxiang Chen ⋅ Tao Feng ⋅ Jianan Li
ExHall D Poster #101
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Poster Session 6
Jian Huang ⋅ Chengrui Dong ⋅ Xuanhua Chen ⋅ Peidong Liu
ExHall D Poster #75
FASTer: Focal token Acquiring-and-Scaling Transformer for Long-term 3D Objection Detection Poster Session 4
Chenxu Dang ⋅ Pei An ⋅ Xinmin Zhang ⋅ ZaiPeng Duan ⋅ Xuzhong Hu ⋅ Jie Ma
ExHall D Poster #116
SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction Poster Session 2
ZaiPeng Duan ⋅ Xuzhong Hu ⋅ Pei An ⋅ Junfeng Ding ⋅ Jie Zhan ⋅ Chenxu Dang ⋅ Yunbiao Xu ⋅ Jie Ma
ExHall D Poster #132
Cheb-GR: Rethinking K-nearest Neighbor Search in Re-ranking for Person Re-identification Poster Session 4
Jinxi Yang ⋅ He Li ⋅ Bo Du ⋅ Mang Ye
ExHall D Poster #330
FIction: 4D Future Interaction Prediction from Video Poster Session 4
Kumar Ashutosh ⋅ Georgios Pavlakos ⋅ Kristen Grauman
ExHall D Poster #173
Boost Your Human Image Generation Model via Direct Preference Optimization Poster Session 5
Sanghyeon Na ⋅ Yonggyu Kim ⋅ Hyunjoon Lee
ExHall D Poster #238
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving Poster Session 3
Georg Hess ⋅ Carl Lindström ⋅ Maryam Fatemi ⋅ Christoffer Petersson ⋅ Lennart Svensson
ExHall D Poster #129
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes Poster Session 3
Aodi Li ⋅ Liansheng Zhuang ⋅ Xiao Long ⋅ MingHong Yao ⋅ Shafei Wang
ExHall D Poster #450
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras Poster Session 6
Hoonhee Cho ⋅ Jae-Young Kang ⋅ Youngho Kim ⋅ Kuk-Jin Yoon
ExHall D Poster #100
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation Poster Session 2
Ziyang Luo ⋅ Haoning Wu ⋅ Dongxu Li ⋅ Jing Ma ⋅ Mohan Kankanhalli ⋅ Junnan Li
ExHall D Poster #295
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation Poster Session 5
Taeyoung Yun ⋅ Dinghuai Zhang ⋅ Jinkyoo Park ⋅ Ling Pan
ExHall D Poster #248
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition Poster Session 2
Yang Chen ⋅ Jingcai Guo ⋅ Song Guo ⋅ Dacheng Tao
ExHall D Poster #320
GIF: Generative Inspiration for Face Recognition at Scale Poster Session 1
Mohammad Saadabadi Saadabadi ⋅ Sahar Rahimi Malakshan ⋅ Ali Dabouei ⋅ Srinjoy Das ⋅ Jeremy M. Dawson ⋅ Nasser M Nasrabadi
ExHall D Poster #320
Pos3R: 6D Pose Estimation for Unseen Objects Made Easy Poster Session 4
Weijian Deng ⋅ Dylan Campbell ⋅ Chunyi Sun ⋅ Jiahao Zhang ⋅ Shubham Kanitkar ⋅ Matthew Shaffer ⋅ Stephen Gould
ExHall D Poster #95
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation Poster Session 4
Yu Qi ⋅ Yuanchen Ju ⋅ Tianming Wei ⋅ Chi Chu ⋅ Lawson L.S. Wong ⋅ Huazhe Xu
ExHall D Poster #152
Co-op: Correspondence-based Novel Object Pose Estimation Poster Session 3
Sungphill Moon ⋅ Hyeontae Son ⋅ Dongcheol Hur ⋅ Sangwook Kim
ExHall D Poster #94
Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning Poster Session 3
Juntae Lee ⋅ Munawar Hayat ⋅ Sungrack Yun
ExHall D Poster #448
ALIEN: Implicit Neural Representations for Human Motion Prediction under Arbitrary Latency Poster Session 1
Dong Wei ⋅ Xiaoning Sun ⋅ Xizhan Gao ⋅ Shengxiang Hu ⋅ Huaijiang Sun
ExHall D Poster #157
Seurat: From Moving Points to Depth Poster Session 2
Seokju Cho ⋅ Gabriel Huang ⋅ Seungryong Kim ⋅ Joon-Young Lee
ExHall D Poster #177
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts Poster Session 4
Peijie Wang ⋅ Zhong-Zhi Li ⋅ Fei Yin ⋅ Dekang Ran ⋅ Cheng-Lin Liu
ExHall D Poster #356
DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering Poster Session 1
Yihao Wang ⋅ Marcus Klasson ⋅ Matias Turkulainen ⋅ Shuzhe Wang ⋅ Juho Kannala ⋅ Arno Solin
ExHall D Poster #52
EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation Poster Session 1
Daikun Liu ⋅ Lei Cheng ⋅ Teng Wang ⋅ Changyin Sun
ExHall D Poster #168
Dual Prompting Image Restoration with Diffusion Transformers Poster Session 3
Dehong Kong ⋅ Fan Li ⋅ Zhixin Wang ⋅ Jiaqi Xu ⋅ Renjing Pei ⋅ Wenbo Li ⋅ Wenqi Ren
ExHall D Poster #206
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability Poster Session 2
Weijie Zhou ⋅ Manli Tao ⋅ Chaoyang Zhao ⋅ Haiyun Guo ⋅ Honghui Dong ⋅ Ming Tang ⋅ Jinqiao Wang
ExHall D Poster #150
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection Poster Session 5
Hou-I Liu ⋅ Christine Wu ⋅ Jen-Hao Cheng ⋅ Wenhao Chai ⋅ Shian-yun Wang ⋅ Gaowen Liu ⋅ Hugo Latapie ⋅ Jhih-Ciang Wu ⋅ Jenq-Neng Hwang ⋅ Hong-Han Shuai ⋅ Wen-Huang Cheng
ExHall D Poster #116
Foveated Instance Segmentation Poster Session 5
Hongyi Zeng ⋅ Wenxuan Liu ⋅ Tianhua Xia ⋅ Jinhui Chen ⋅ Ziyun Li ⋅ Sai Qian Zhang
ExHall D Poster #331
Zero-Shot Head Swapping in Real-World Scenarios Poster Session 3
Sohyun Jeong ⋅ Taewoong Kang ⋅ Hyojin Jang ⋅ Jaegul Choo
ExHall D Poster #14
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors Poster Session 1
Wonbong Jang ⋅ Philippe Weinzaepfel ⋅ Vincent Leroy ⋅ Lourdes Agapito ⋅ Jerome Revaud
ExHall D Poster #84
Sampling Innovation-Based Adaptive Compressive Sensing Poster Session 1
Zhifu Tian ⋅ Tao Hu ⋅ Chaoyang Niu ⋅ Di Wu ⋅ Shu Wang
ExHall D Poster #209
Scalable Autoregressive Monocular Depth Estimation Poster Session 2
Jinhong Wang ⋅ Jintai Chen ⋅ Jian liu ⋅ Dongqi Tang ⋅ Wentong Li ⋅ Weiqiang Wang ⋅ Danny Chen ⋅ Jian Wu
ExHall D Poster #79
Parameterized Blur Kernel Prior Learning for Local Motion Deblurring Poster Session 5
Zhenxuan Fang ⋅ Fangfang Wu ⋅ Tao Huang ⋅ Le Dong ⋅ Weisheng Dong ⋅ Xin Li ⋅ Guangming Shi
ExHall D Poster #185
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map Poster Session 2
Xinyuan Chang ⋅ Maixuan Xue ⋅ Xinran Liu ⋅ Zheng Pan ⋅ Xing Wei
ExHall D Poster #139
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation Poster Session 1
Datao Tang ⋅ Xiangyong Cao ⋅ Xuan Wu ⋅ Jialin Li ⋅ Jing Yao ⋅ Xueru Bai ⋅ Dongsheng Jiang ⋅ Yin Li ⋅ Deyu Meng
ExHall D Poster #328
IDEA-Bench: How Far are Generative Models from Professional Designing? Poster Session 4
Chen Liang ⋅ Lianghua Huang ⋅ Jingwu Fang ⋅ Huanzhang Dou ⋅ Wei Wang ⋅ Zhi-Fan Wu ⋅ Yupeng Shi ⋅ Junge Zhang ⋅ Xin Zhao ⋅ Yu Liu
ExHall D Poster #264
Enhancing Dataset Distillation via Non-Critical Region Refinement Poster Session 2
Minh-Tuan Tran ⋅ Trung Le ⋅ Xuan-May Le ⋅ Thanh-Toan Do ⋅ Dinh Phung
ExHall D Poster #442
Logits DeConfusion with CLIP for Few-Shot Learning Poster Session 5
Shuo Li ⋅ Fang Liu ⋅ Zehua Hao ⋅ Xinyi Wang ⋅ Lingling Li ⋅ Xu Liu ⋅ Puhua Chen ⋅ Wenping Ma
ExHall D Poster #417
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning Poster Session 5
Yang Liu ⋅ Qianqian Xu ⋅ Peisong Wen ⋅ Siran Dai ⋅ Qingming Huang
ExHall D Poster #288
GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning Poster Session 1
Guangyan Chen ⋅ Te Cui ⋅ Meiling Wang ⋅ Yang Chengcai ⋅ Mengxiao Hu ⋅ Haoyang Lu ⋅ Yao Mu ⋅ Zicai Peng ⋅ Tianxing Zhou ⋅ XINRAN JIANG ⋅ Yi Yang ⋅ Yufeng Yue
ExHall D Poster #148
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Poster Session 5
Hualie Jiang ⋅ Zhiqiang Lou ⋅ Laiyan Ding ⋅ Rui Xu ⋅ Minglang Tan ⋅ jerett ⋅ Rui Huang
ExHall D Poster #77
Minding Fuzzy Regions: A Data-driven Alternating Learning Paradigm for Stable Lesion Segmentation Poster Session 2
Lexin Fang ⋅ Yunyang Xu ⋅ Xiang Ma ⋅ Xuemei Li ⋅ Caiming Zhang
ExHall D Poster #481
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting Poster Session 1
Xiaoyan Xing ⋅ Konrad Groh ⋅ Sezer Karaoglu ⋅ Theo Gevers ⋅ Anand Bhattad
ExHall D Poster #26
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection Poster Session 6
wenqiao Li ⋅ Yao Gu ⋅ Xintao Chen ⋅ Xiaohao Xu ⋅ Ming Hu ⋅ Xiaonan Huang ⋅ Yingna Wu
ExHall D Poster #408
Type-R: Automatically Retouching Typos for Text-to-Image Generation Poster Session 1
Wataru Shimoda ⋅ Naoto Inoue ⋅ Daichi Haraguchi ⋅ Hayato Mitani ⋅ Seiichi Uchida ⋅ Kota Yamaguchi
ExHall D Poster #248
HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting Poster Session 6
Xinpeng Liu ⋅ Zeyi Huang ⋅ Fumio Okura ⋅ Yasuyuki Matsushita
ExHall D Poster #54
Multi-Label Prototype Visual Spatial Search for Weakly Supervised Semantic Segmentation Poster Session 6
Songsong Duan ⋅ Xi Yang ⋅ Nannan Wang
ExHall D Poster #392
High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model Poster Session 1
Mingtao Guo ⋅ Guanyu Xing ⋅ Yanli Liu
ExHall D Poster #6
Overcoming Shortcut Problem in VLM for Robust Out-of-Distribution Detection Poster Session 3
Zhuo Xu ⋅ Xiang Xiang ⋅ Yifan Liang
ExHall D Poster #455
Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification Poster Session 6
Gaozheng Pei ⋅ Shaojie Lyu ⋅ Gong Chen ⋅ Ke Ma ⋅ Qianqian Xu ⋅ Yingfei Sun ⋅ Qingming Huang
ExHall D Poster #298
PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors Poster Session 6
Guangshun Wei ⋅ Yuan Feng ⋅ Long Ma ⋅ Chen Wang ⋅ Yuanfeng Zhou ⋅ Changjian Li
ExHall D Poster #104
One-Step Event-Driven High-Speed Autofocus Poster Session 2
Yuhan Bao ⋅ Shaohua Gao ⋅ Wenyong Li ⋅ Kaiwei Wang
ExHall D Poster #75
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation Poster Session 3
Zhuguanyu Wu ⋅ Shihe Wang ⋅ Jiayi Zhang ⋅ Jiaxin Chen ⋅ Yunhong Wang
ExHall D Poster #405
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers Poster Session 2
Zhuguanyu Wu ⋅ Jiayi Zhang ⋅ Jiaxin Chen ⋅ Jinyang Guo ⋅ Di Huang ⋅ Yunhong Wang
ExHall D Poster #411
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows Poster Session 6
Shentong Mo ⋅ Yibing Song
ExHall D Poster #261
Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space Poster Session 4
Yi Liu ⋅ Wengen Li ⋅ Jihong Guan ⋅ Shuigeng Zhou ⋅ Yichao Zhang
ExHall D Poster #195
V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents Poster Session 1
Zhengrong Yue ⋅ Shaobin Zhuang ⋅ Kunchang Li ⋅ Yanbo Ding ⋅ Yali Wang
ExHall D Poster #290
UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery Poster Session 4
Dianmo Sheng ⋅ Dongdong Chen ⋅ Zhentao Tan ⋅ Qiankun Liu ⋅ Qi Chu ⋅ Tao Gong ⋅ Bin Liu ⋅ Jing Han ⋅ Wenbin Tu ⋅ Shengwei Xu ⋅ Nenghai Yu
ExHall D Poster #419
Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression Poster Session 3
Zhenqi Dai ⋅ Ting Liu ⋅ Yanning Zhang
ExHall D Poster #48
ControlFace: Harnessing Facial Parametric Control for Face Rigging Poster Session 2
Wooseok Jang ⋅ Youngjun Hong ⋅ Geonho Cha ⋅ Seungryong Kim
ExHall D Poster #16
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance Poster Session 1
Dian Shao ⋅ Mingfei Shi ⋅ Shengda Xu ⋅ Haodong Chen ⋅ Yongle Huang ⋅ Binglu Wang
ExHall D Poster #161
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Poster Session 6
Wentao Qu ⋅ Jing Wang ⋅ Yongshun Gong ⋅ Xiaoshui Huang ⋅ Liang Xiao
ExHall D Poster #112
RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds Poster Session 5
Kang You ⋅ Tong Chen ⋅ Dandan Ding ⋅ M. Salman Asif ⋅ Zhan Ma
ExHall D Poster #107
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting Poster Session 4
Jiaxin Zhang ⋅ Junjun Jiang ⋅ Youyu Chen ⋅ Kui Jiang ⋅ Xianming Liu
ExHall D Poster #337
BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing Poster Session 4
Yunqi Gu ⋅ Ian Huang ⋅ Jihyeon Je ⋅ Guandao Yang ⋅ Leonidas Guibas
ExHall D Poster #267
LaVin-DiT: Large Vision Diffusion Transformer Poster Session 4
Zhaoqing Wang ⋅ Xiaobo Xia ⋅ Runnan Chen ⋅ Dongdong Yu ⋅ Changhu Wang ⋅ Mingming Gong ⋅ Tongliang Liu
ExHall D Poster #406
A Simple Data Augmentation for Feature Distribution Skewed Federated Learning Poster Session 5
Yunlu Yan ⋅ Huazhu Fu ⋅ Yuexiang Li ⋅ Jinheng Xie ⋅ Jun Ma ⋅ Guang Yang ⋅ Lei Zhu
ExHall D Poster #451
Track Any Anomalous Object:A Granular Video Anomaly Detection Pipeline Poster Session 2
Yuzhi Huang ⋅ Chenxin Li ⋅ Haitao Zhang ⋅ Zixu Lin ⋅ yunlong lin ⋅ Hengyu Liu ⋅ Wuyang Li ⋅ Xinyu Liu ⋅ Jiechao Gao ⋅ Yue Huang ⋅ Xinghao Ding ⋅ Yixuan Yuan
ExHall D Poster #317
Diffusion Self-Distillation for Zero-Shot Customized Image Generation Poster Session 4
Shengqu Cai ⋅ Eric Ryan Chan ⋅ Yunzhi Zhang ⋅ Leonidas Guibas ⋅ Jiajun Wu ⋅ Gordon Wetzstein
ExHall D Poster #254
Soft Self-labeling and Potts Relaxations for Weakly-supervised Segmentation Poster Session 4
Zhongwen Zhang ⋅ Yuri Boykov
ExHall D Poster #423
TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression Poster Session 6
Xinjie Wang ⋅ Yifan Zhang ⋅ Ting Liu ⋅ Xinpu Liu ⋅ Ke Xu ⋅ Jianwei Wan ⋅ Yulan Guo ⋅ Hanyun Wang
ExHall D Poster #110
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction Poster Session 5
Rui Qian ⋅ Shuangrui Ding ⋅ Xiaoyi Dong ⋅ Pan Zhang ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Dahua Lin ⋅ Jiaqi Wang
ExHall D Poster #289
HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion Poster Session 2
Yifang Xu ⋅ BenXiang Zhai ⋅ Yunzhuo Sun ⋅ Ming Li ⋅ Yang Li ⋅ Sidan Du
ExHall D Poster #17
ZeroVO: Visual Odometry with Minimal Assumptions Poster Session 4
Lei Lai ⋅ Zekai Yin ⋅ Eshed Ohn-Bar
ExHall D Poster #122
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation Poster Session 3
Zelin Peng ⋅ Zhengqin Xu ⋅ Zhilin Zeng ⋅ Yu Huang ⋅ Yaoming Wang ⋅ Wei Shen
ExHall D Poster #417
Domain Generalization in CLIP via Learning with Diverse Text Prompts Poster Session 2
Changsong Wen ⋅ Zelin Peng ⋅ Yu Huang ⋅ Xiaokang Yang ⋅ Wei Shen
ExHall D Poster #399
HVI: A New Color Space for Low-light Image Enhancement Poster Session 2
Qingsen Yan ⋅ Yixu Feng ⋅ Cheng Zhang ⋅ Guansong Pang ⋅ Kangbiao Shi ⋅ Peng Wu ⋅ Wei Dong ⋅ Jinqiu Sun ⋅ Yanning Zhang
ExHall D Poster #22
ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression Poster Session 2
Wei Jiang ⋅ Junru Li ⋅ Kai Zhang ⋅ Li zhang
ExHall D Poster #188
Learning Dynamic Collaborative Network for Semi-supervised 3D Vessel Segmentation Poster Session 2
Jiao Xu ⋅ Xin Chen ⋅ Lihe Zhang
ExHall D Poster #483
Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data Poster Session 5
Lilin Zhang ⋅ Chengpei Wu ⋅ Ning Yang
ExHall D Poster #448
Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model Poster Session 2
Shengjun Zhang ⋅ Jinzhao Li ⋅ Xin Fei ⋅ Hao Liu ⋅ Yueqi Duan
ExHall D Poster #62
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models Poster Session 1
Junzhe Chen ⋅ Tianshu Zhang ⋅ Shiyu Huang ⋅ Yuwei Niu ⋅ Linfeng Zhang ⋅ Lijie Wen ⋅ Xuming Hu
ExHall D Poster #386
UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition Poster Session 6
Meng Pang ⋅ Wenjun Zhang ⋅ Nanrun Zhou ⋅ Shengbo Chen ⋅ Hong Rao
ExHall D Poster #301
Graph-Embedded Structure-Aware Perceptual Hashing for Neural Network Protection and Piracy Detection Poster Session 4
Ruiheng Liu ⋅ Haozhe Chen ⋅ Boyao Zhao ⋅ Kejiang Chen ⋅ Weiming Zhang
ExHall D Poster #416
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model Poster Session 6
Shunlin Lu ⋅ Jingbo Wang ⋅ Zeyu Lu ⋅ Ling-Hao Chen ⋅ Wenxun Dai ⋅ Junting Dong ⋅ Zhiyang Dou ⋅ Bo Dai ⋅ Ruimao Zhang
ExHall D Poster #162
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion Poster Session 5
longbin ji ⋅ Lei Zhong ⋅ Pengfei Wei ⋅ Changjian Li
ExHall D Poster #163
UCM-VeID V2: A Richer Dataset and A Pre-training Method for UAV Cross-Modality Vehicle Re-Identification Poster Session 5
Xingyue Liu ⋅ Jiahao Qi ⋅ Chen Chen ⋅ Kangcheng Bin ⋅ Ping Zhong
ExHall D Poster #118
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts Poster Session 6
Zhaoxing Gan ⋅ Mengtian Li ⋅ Ruhua Chen ⋅ Zhongxia JI ⋅ Sichen Guo ⋅ Huanling Hu ⋅ Guangnan Ye ⋅ Zuo Hu
ExHall D Poster #242
Less is More: Efficient Model Merging with Binary Task Switch Poster Session 3
Biqing Qi ⋅ Fangyuan Li ⋅ Zhen Wang ⋅ Junqi Gao ⋅ Dong Li ⋅ Peng Ye ⋅ Bowen Zhou
ExHall D Poster #442
Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation Poster Session 1
Hao Li ⋅ Ju Dai ⋅ Xin Zhao ⋅ Feng Zhou ⋅ Junjun Pan ⋅ Lei Li
ExHall D Poster #2
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer Poster Session 5
Jiahao Cui ⋅ Hui Li ⋅ Qingkun Su ⋅ Hanlin Shang ⋅ Kaihui Cheng ⋅ Yuqi Ma ⋅ Shan Mu ⋅ Hang Zhou ⋅ Jingdong Wang ⋅ Siyu Zhu
ExHall D Poster #4
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection Poster Session 4
Xinjie Cui ⋅ Yuezun Li ⋅ Ao Luo ⋅ Jiaran Zhou ⋅ Junyu Dong
ExHall D Poster #325
GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting Poster Session 4
Zixuan Chen ⋅ Guangcong Wang ⋅ Jiahao Zhu ⋅ Jianhuang Lai ⋅ Xiaohua Xie
ExHall D Poster #45
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Poster Session 6
Hao Li ⋅ Changyao TIAN ⋅ Jie Shao ⋅ Xizhou Zhu ⋅ Zhaokai Wang ⋅ Jinguo Zhu ⋅ Wenhan Dou ⋅ Xiaogang Wang ⋅ Hongsheng Li ⋅ Lewei Lu ⋅ Jifeng Dai
ExHall D Poster #347
Continual SFT Matches Multimodal RLHF with Negative Supervision Poster Session 3
Ke Zhu ⋅ Yu Wang ⋅ Yanpeng Sun ⋅ Qiang Chen ⋅ Jiang-Jiang Liu ⋅ gang zhang ⋅ Jingdong Wang
ExHall D Poster #380
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation Poster Session 2
Kang Liu ⋅ Zhuoqi Ma ⋅ Xiaolu Kang ⋅ Yunan Li ⋅ Kun XIE ⋅ Zhicheng Jiao ⋅ Qiguang Miao
ExHall D Poster #474
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward Poster Session 3
Zhiwei Jia ⋅ Yuesong Nan ⋅ Huixi Zhao ⋅ Gengdai Liu
ExHall D Poster #216
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video Poster Session 2
Jiawei Zhang ⋅ Zijian Wu ⋅ Zhiyang Liang ⋅ Yicheng Gong ⋅ Dongfang Hu ⋅ Yao Yao ⋅ Xun Cao ⋅ Hao Zhu
ExHall D Poster #8
LMO: Linear Mamba Operator for MRI Reconstruction Poster Session 1
Wei Li ⋅ jiawei jiang ⋅ Jie Wu ⋅ Kaihao Yu ⋅ Jianwei Zheng
ExHall D Poster #473
Mimir: Improving Video Diffusion Models for Precise Text Understanding Poster Session 5
Shuai Tan ⋅ Biao Gong ⋅ Yutong Feng ⋅ Kecheng Zheng ⋅ DanDan Zheng ⋅ Shuwei Shi ⋅ Yujun Shen ⋅ Jingdong Chen ⋅ Ming Yang
ExHall D Poster #283
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Poster Session 2
Yuhao Dong ⋅ Zuyan Liu ⋅ Hai-Long Sun ⋅ Jingkang Yang ⋅ Winston Hu ⋅ Yongming Rao ⋅ Ziwei Liu
ExHall D Poster #353
WildAvatar: Learning In-the-wild 3D Avatars from the Web Poster Session 4
Zihao Huang ⋅ Shoukang Hu ⋅ Guangcong Wang ⋅ Tianqi Liu ⋅ Yuhang Zang ⋅ Zhiguo Cao ⋅ Wei Li ⋅ Ziwei Liu
ExHall D Poster #10
MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model Poster Session 3
Haoyuan Wang ⋅ Zhenwei Wang ⋅ Xiaoxiao Long ⋅ Cheng Lin ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
ExHall D Poster #32
Decoupled Motion Expression Video Segmentation Poster Session 3
Hao Fang ⋅ Runmin Cong ⋅ Xiankai Lu ⋅ Xiaofei Zhou ⋅ Sam Kwong ⋅ Wei Zhang
ExHall D Poster #303
Incremental Object Keypoint Learning Poster Session 5
Mingfu Liang ⋅ Jiahuan Zhou ⋅ Xu Zou ⋅ Ying Wu
ExHall D Poster #416
Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion Poster Session 6
Zhiqiang Yan ⋅ Zhengxue Wang ⋅ Kun Wang ⋅ Jun Li ⋅ Jian Yang
ExHall D Poster #76
StableAnimator: High-Quality Identity-Preserving Human Image Animation Poster Session 5
Shuyuan Tu ⋅ Zhen Xing ⋅ Xintong Han ⋅ Zhi-Qi Cheng ⋅ Qi Dai ⋅ Chong Luo ⋅ Zuxuan Wu
ExHall D Poster #5
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling Poster Session 2
Jian Yang ⋅ Dacheng Yin ⋅ Yizhou Zhou ⋅ Fengyun Rao ⋅ Wei Zhai ⋅ Yang Cao ⋅ Zheng-Jun Zha
ExHall D Poster #249
Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation Poster Session 2
Yuxin Li ⋅ Zihao Zhu ⋅ Yuxiang Zhang ⋅ Yifan Chen ⋅ Zhibin Yu
ExHall D Poster #478
Understanding Multi-Task Activities from Single-Task Videos Poster Session 4
Yuhan Shen ⋅ Ehsan Elhamifar
ExHall D Poster #317
Interleaved-Modal Chain-of-Thought Poster Session 4
Jun Gao ⋅ Yongqi Li ⋅ Ziqiang Cao ⋅ Wenjie Li
ExHall D Poster #354
SLADE: Shielding against Dual Exploits in Large Vision-Language Models Poster Session 5
Md Zarif Hossain ⋅ AHMED IMTEAJ
ExHall D Poster #308
VideoDirector: Precise Video Editing via Text-to-Video Models Poster Session 1
Yukun Wang ⋅ Longguang Wang ⋅ Zhiyuan Ma ⋅ Qibin Hu ⋅ Kai Xu ⋅ Yulan Guo
ExHall D Poster #232
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding Poster Session 1
Yuki Kawana ⋅ Shintaro Shiba ⋅ Quan Kong ⋅ Norimasa Kobori
ExHall D Poster #279
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding Poster Session 3
Zichen Liu ⋅ Kunlun Xu ⋅ Bing Su ⋅ Xu Zou ⋅ Yuxin Peng ⋅ Jiahuan Zhou
ExHall D Poster #299
Improving the Training of Data-Efficient GANs via Quality Aware Dynamic Discriminator Rejection Sampling Poster Session 6
Zhaoyu Zhang ⋅ Yang Hua ⋅ Guanxiong Sun ⋅ Hui Wang ⋅ Seán F. McLoone
ExHall D Poster #434
Believing is Seeing: Unobserved Object Detection using Generative Models Poster Session 4
Subhransu S. Bhattacharjee ⋅ Dylan Campbell ⋅ Rahul Shome
ExHall D Poster #340
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget Poster Session 6
Vikash Sehwag ⋅ Xianghao Kong ⋅ Jingtao Li ⋅ Michael Spranger ⋅ Lingjuan Lyu
ExHall D Poster #232
Guiding Human-Object Interactions with Rich Geometry and Relations Poster Session 5
Mengqing Xue ⋅ Yifei Liu ⋅ Ling Guo ⋅ Shaoli Huang ⋅ Changxing Ding
ExHall D Poster #157
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion Poster Session 3
Yiran Wang ⋅ Jiaqi Li ⋅ Chaoyi Hong ⋅ Ruibo Li ⋅ Liusheng Sun ⋅ Xiao Song ⋅ Zhe Wang ⋅ Zhiguo Cao ⋅ Guosheng Lin
ExHall D Poster #110
Physical Plausibility-aware Trajectory Prediction via Locomotion Embodiment Poster Session 3
Hiromu Taketsugu ⋅ Takeru Oba ⋅ Takahiro Maeda ⋅ Shohei Nobuhara ⋅ Norimichi Ukita
ExHall D Poster #160
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Poster Session 2
Jiawei Lin ⋅ Shizhao Sun ⋅ Danqing Huang ⋅ Ting Liu ⋅ Ji Li ⋅ Jiang Bian
ExHall D Poster #263
Masking meets Supervision: A Strong Learning Alliance Poster Session 4
Byeongho Heo ⋅ Taekyung Kim ⋅ Sangdoo Yun ⋅ Dongyoon Han
ExHall D Poster #442
DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation Poster Session 3
Wang Zhao ⋅ Yan-Pei Cao ⋅ Jiale Xu ⋅ Yue-Jiang Dong ⋅ Ying Shan
ExHall D Poster #39
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models Poster Session 1
Matt Deitke ⋅ Christopher Clark ⋅ Sangho Lee ⋅ Rohun Tripathi ⋅ Yue Yang ⋅ Jae Sung Park ⋅ Reza Salehi ⋅ Niklas Muennighoff ⋅ Kyle Lo ⋅ Luca Soldaini ⋅ Jiasen Lu ⋅ Taira Anderson ⋅ Erin Bransom ⋅ Kiana Ehsani ⋅ Huong Ngo ⋅ Yen-Sung Chen ⋅ Ajay Patel ⋅ Mark Yatskar ⋅ Chris Callison-Burch ⋅ Andrew Head ⋅ Rose Hendrix ⋅ Favyen Bastani ⋅ Eli VanderBilt ⋅ Nathan Lambert ⋅ Yvonne Chou ⋅ Arnavi Chheda-Kothary ⋅ Jenna Sparks ⋅ Sam Skjonsberg ⋅ Michael Schmitz ⋅ Aaron Sarnat ⋅ Byron Bischoff ⋅ Pete Walsh ⋅ Christopher Newell ⋅ Piper Wolters ⋅ Tanmay Gupta ⋅ Kuo-Hao Zeng ⋅ Jon Borchardt ⋅ Dirk Groeneveld ⋅ Crystal Nam ⋅ Sophie Lebrecht ⋅ Caitlin Wittlif ⋅ Carissa Schoenick ⋅ Oscar Michel ⋅ Ranjay Krishna ⋅ Luca Weihs ⋅ Noah A. Smith ⋅ Hannaneh Hajishirzi ⋅ Ross Girshick ⋅ Ali Farhadi ⋅ Aniruddha Kembhavi
ExHall D Poster #370
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving Poster Session 4
Zhenhua Xu ⋅ Yan Bai ⋅ Yujia Zhang ⋅ Zhuoling Li ⋅ Fei Xia ⋅ Kwan-Yee K. Wong ⋅ Jianqiang Wang ⋅ Hengshuang Zhao
ExHall D Poster #138
High-Fidelity Lightweight Mesh Reconstruction from Point Clouds Poster Session 3
Chen Zhang ⋅ Wentao Wang ⋅ Ximeng Li ⋅ Xinyao Liao ⋅ Wanjuan Su ⋅ Wenbing Tao
ExHall D Poster #105
ORIDa: Object-centric Real-world Image Composition Dataset Poster Session 1
Jinwoo Kim ⋅ Sangmin Han ⋅ Jinho Jeong ⋅ Jiwoo Choi ⋅ Dongyoung Kim ⋅ Seon Joo Kim
ExHall D Poster #276
OSDFace: One-Step Diffusion Model for Face Restoration Poster Session 3
Jingkai Wang ⋅ Jue Gong ⋅ Lin Zhang ⋅ Zheng Chen ⋅ Xing Liu ⋅ Hong Gu ⋅ Yutong Liu ⋅ Yulun Zhang ⋅ Xiaokang Yang
ExHall D Poster #189
Task Singular Vectors: Reducing Task Interference in Model Merging Poster Session 4
Antonio Andrea Gargiulo ⋅ Donato Crisostomi ⋅ Maria Sofia Bucarelli ⋅ Simone Scardapane ⋅ Fabrizio Silvestri ⋅ Emanuele Rodolà
ExHall D Poster #278
Dragin3D: Image Editing by Dragging in 3D Space Poster Session 5
Weiran Guang ⋅ Xiaoguang Gu ⋅ Mengqi Huang ⋅ Zhendong Mao
ExHall D Poster #43
DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID Poster Session 3
Xin Liang ⋅ Yogesh S. Rawat
ExHall D Poster #318
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Poster Session 2
Di Zhang ⋅ Jingdi Lei ⋅ Junxian Li ⋅ Xunzhi Wang ⋅ Yujie Liu ⋅ Zonglin Yang ⋅ Jiatong LI ⋅ Weida Wang ⋅ Suorong Yang ⋅ Jianbo Wu ⋅ Peng Ye ⋅ Wanli Ouyang ⋅ Dongzhan Zhou
ExHall D Poster #352
Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather Poster Session 1
Longyu Yang ⋅ Ping Hu ⋅ Shangbo Yuan ⋅ Lu Zhang ⋅ Jun Liu ⋅ Heng Tao Shen ⋅ Xiaofeng Zhu
ExHall D Poster #117
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model Poster Session 1
Yingmao Miao ⋅ Zhanpeng Huang ⋅ Rui Han ⋅ Zibin Wang ⋅ Chenhao Lin ⋅ Chao Shen
ExHall D Poster #18
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Poster Session 3
Lital Binyamin ⋅ Yoad Tewel ⋅ Hilit Segev ⋅ Eran Hirsch ⋅ Royi Rassin ⋅ Gal Chechik
ExHall D Poster #247
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios Poster Session 1
Ziming Huang ⋅ Xurui Li ⋅ Haotian Liu ⋅ Feng Xue ⋅ Yuzhe Wang ⋅ Yu Zhou
ExHall D Poster #439
Task-Specific Gradient Adaptation for Few-Shot One-Class Classification Poster Session 6
Yunlong Li ⋅ Xiabi Liu ⋅ Liyuan Pan ⋅ Yuchen Ren
ExHall D Poster #422
ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration Poster Session 2
Johan Edstedt ⋅ André Mateus ⋅ Alberto Jaenal
ExHall D Poster #115
RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives Poster Session 4
Chirag Parikh ⋅ Deepti Rawat ⋅ Rakshitha R. T. ⋅ Tathagata Ghosh ⋅ Ravi Kiran Sarvadevabhatla
ExHall D Poster #306
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild Poster Session 1
Damien Teney ⋅ Liangze Jiang ⋅ Florin Gogianu ⋅ Ehsan Abbasnejad
ExHall D Poster #396
MangaNinja: Line Art Colorization with Precise Reference Following Poster Session 2
Zhiheng Liu ⋅ Ka Leong Cheng ⋅ Xi Chen ⋅ Jie Xiao ⋅ Hao Ouyang ⋅ Kai Zhu ⋅ Yu Liu ⋅ Yujun Shen ⋅ Qifeng Chen ⋅ Ping Luo
ExHall D Poster #21
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation Poster Session 3
Ling-An Zeng ⋅ Guohong Huang ⋅ Yi-Lin Wei ⋅ Shengbo Gu ⋅ Yu-Ming Tang ⋅ Jingke Meng ⋅ Wei-Shi Zheng
ExHall D Poster #163
CLOC: Contrastive Learning for Ordinal Classification with Multi-Margin N-pair Loss Poster Session 3
Dileepa Pitawela ⋅ Gustavo Carneiro ⋅ Hsiang-Ting Chen
ExHall D Poster #468
Universal Actions for Enhanced Embodied Foundation Models Poster Session 5
Jinliang Zheng ⋅ Jianxiong Li ⋅ Dongxiu Liu ⋅ Yinan Zheng ⋅ Zhihao Wang ⋅ Zhonghong Ou ⋅ Yu Liu ⋅ Jingjing Liu ⋅ Ya-Qin Zhang ⋅ Xianyuan Zhan
ExHall D Poster #138
ObjectMover: Generative Object Movement with Video Prior Poster Session 4
Xin Yu ⋅ Tianyu Wang ⋅ Soo Ye Kim ⋅ Paul Guerrero ⋅ Xi Chen ⋅ Qing Liu ⋅ Zhe Lin ⋅ Xiaojuan Qi
ExHall D Poster #179
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking Poster Session 5
Junxi Chen ⋅ Junhao Dong ⋅ Xiaohua Xie
ExHall D Poster #265
Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Poster Session 2
Youngjoon Jang ⋅ Haran Raajesh ⋅ Liliane Momeni ⋅ Gul Varol ⋅ Andrew Zisserman
ExHall D Poster #322
Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis Poster Session 5
Hua Yu ⋅ Weiming Liu ⋅ Gui Xu ⋅ Yaqing Hou ⋅ Yew-Soon Ong ⋅ Qiang Zhang
ExHall D Poster #158
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections Poster Session 1
Weiqi Feng ⋅ Dong Han ⋅ Zekang Zhou ⋅ Shunkai Li ⋅ Xiaoqiang Liu ⋅ Pengfei Wan ⋅ Di ZHANG ⋅ Miao Wang
ExHall D Poster #8
PIAD: Pose and Illumination agnostic Anomaly Detection Poster Session 1
Kaichen Yang ⋅ Junjie Cao ⋅ Zeyu Bai ⋅ Zhixun Su ⋅ Andrea Tagliasacchi
ExHall D Poster #437
VISTREAM: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network Poster Session 2
Kang You ⋅ Ziling Wei ⋅ Jing Yan ⋅ Boning Zhang ⋅ Qinghai Guo ⋅ Yaoyu Zhang ⋅ Zhezhi He
ExHall D Poster #327
PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction Poster Session 6
Mingzhi Pei ⋅ Xu Cao ⋅ Xiangyi Wang ⋅ Heng Guo ⋅ Zhanyu Ma
ExHall D Poster #66
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Poster Session 4
Siyuan Li ⋅ Luyuan Zhang ⋅ Zedong Wang ⋅ Juanxi Tian ⋅ Cheng Tan ⋅ Zicheng Liu ⋅ Chang Yu ⋅ Qingsong Xie ⋅ Haonan Lu ⋅ Haoqian Wang ⋅ Zhen Lei
ExHall D Poster #373
Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning Poster Session 5
Zichen Tian ⋅ Yaoyao Liu ⋅ Qianru Sun
ExHall D Poster #188
EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision Poster Session 6
Yiming Zhao ⋅ Taein Kwon ⋅ Paul Streli ⋅ Marc Pollefeys ⋅ Christian Holz
ExHall D Poster #149
Mind the Gap: Detecting Black-box Adversarial Attacks in the Making through Query Update Analysis Poster Session 2
Jeonghwan Park ⋅ Niall McLaughlin ⋅ Ihsen Alouani
ExHall D Poster #463
Consistent Normal Orientation for 3D Point Clouds via Least Squares on Delaunay Graph Poster Session 4
Rao Fu ⋅ Jianmin Zheng ⋅ Liang Yu
ExHall D Poster #107
ICP: Immediate Compensation Pruning for Mid-to-high Sparsity Poster Session 2
Xin Luo ⋅ Fu Xueming ⋅ Zihang Jiang ⋅ S Kevin Zhou
ExHall D Poster #392
Optimizing for the Shortest Path in Denoising Diffusion Model Poster Session 4
Ping Chen ⋅ Xingpeng Zhang ⋅ Zhaoxiang Liu ⋅ Huan Hu ⋅ Xiang Liu ⋅ Kai Wang ⋅ Min Wang ⋅ Yanlin Qian ⋅ Shiguo Lian
ExHall D Poster #211
Teaching Large Language Models to Regress Accurate Image Quality Scores Using Score Distribution Poster Session 3
Zhiyuan You ⋅ Xin Cai ⋅ Jinjin Gu ⋅ Tianfan Xue ⋅ Chao Dong
ExHall D Poster #366
v-CLR: View-Consistent Learning for Open-World Instance Segmentation Poster Session 4
Chang-Bin Zhang ⋅ Jinhong Ni ⋅ Yujie Zhong ⋅ Kai Han
ExHall D Poster #429
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference Poster Session 2
Hao Yin ⋅ Guangzong Si ⋅ Zilei Wang
ExHall D Poster #382
Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation Poster Session 6
Qingchen Tang ⋅ Lei Fan ⋅ Maurice Pagnucco ⋅ Yang Song
ExHall D Poster #395
HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation Poster Session 1
Hongye Cheng ⋅ Tianyu Wang ⋅ guangsi shi ⋅ Zexing Zhao ⋅ Yanwei Fu
ExHall D Poster #69
Minority-Focused Text-to-Image Generation via Prompt Optimization Poster Session 5
Soobin Um ⋅ Jong Chul Ye
ExHall D Poster #243
Less Attention is More: Prompt Transformer for Generalized Category Discovery Poster Session 6
Wei Zhang ⋅ Baopeng Zhang ⋅ Zhu Teng ⋅ Wenxin Luo ⋅ Junnan Zou ⋅ Jianping Fan
ExHall D Poster #400
Imputation-free and Alignment-free: Incomplete Multi-view Clustering Driven by Consensus Semantic Learning Poster Session 1
yuzhuo dai ⋅ Jiaqi Jin ⋅ Zhibin Dong ⋅ Siwei Wang ⋅ Xinwang Liu ⋅ En Zhu ⋅ Xihong Yang ⋅ Xinbiao Gan ⋅ Yu Feng
ExHall D Poster #469
Sensitivity-Aware Efficient Fine-Tuning via Compact Dynamic-Rank Adaptation Poster Session 2
Tianran Chen ⋅ Jiarui Chen ⋅ Baoquan Zhang ⋅ Zhehao Yu ⋅ Shidong Chen ⋅ Rui Ye ⋅ Xutao Li ⋅ Yunming Ye
ExHall D Poster #408
MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting Poster Session 3
Sangwoon Kwak ⋅ Joonsoo Kim ⋅ Jun Young Jeong ⋅ Won-Sik Cheong ⋅ Jihyong Oh ⋅ Munchurl Kim
ExHall D Poster #65
CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging Poster Session 1
Zhiwei Ling ⋅ Yachen Chang ⋅ Hailiang Zhao ⋅ Xinkui Zhao ⋅ Kingsum Chow ⋅ Shuiguang Deng
ExHall D Poster #459
DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing Poster Session 4
Yufei Huang ⋅ Bangyan Liao ⋅ Yuqi Hu ⋅ Haitao Lin ⋅ Lirong Wu ⋅ Siyuan Li ⋅ Cheng Tan ⋅ Zicheng Liu ⋅ Yunfan Liu ⋅ Zelin Zang ⋅ Chang Yu ⋅ Zhen Lei
ExHall D Poster #43
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances Poster Session 4
Yi Yu ⋅ Botao Ren ⋅ Peiyuan Zhang ⋅ Mingxin Liu ⋅ Junwei Luo ⋅ Shaofeng Zhang ⋅ Feipeng Da ⋅ Junchi Yan ⋅ Xue Yang
ExHall D Poster #332
A Selective Re-learning Mechanism for Hyperspectral Fusion Imaging Poster Session 2
Yuanye Liu ⋅ jinyang liu ⋅ Renwei Dian ⋅ Shutao Li
ExHall D Poster #198
Autoregressive Sequential Pretraining for Visual Tracking Poster Session 2
Shiyi Liang ⋅ Yifan Bai ⋅ Yihong Gong ⋅ Xing Wei
ExHall D Poster #181
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images Poster Session 5
Kazi Sajeed Mehrab ⋅ M. Maruf ⋅ Arka Daw ⋅ Abhilash Neog ⋅ Harish Babu Manogaran ⋅ Mridul Khurana ⋅ Zhenyang Feng ⋅ Bahadir Altintas ⋅ Yasin Bakis ⋅ Elizabeth Campolongo ⋅ Matthew Thompson ⋅ Xiaojun Wang ⋅ Hilmar Lapp ⋅ Tanya Berger-Wolf ⋅ Paula Mabee ⋅ Henry Bart ⋅ Wei-Lun Chao ⋅ Wasla Dahdul ⋅ Anuj Karpatne
ExHall D Poster #311
Number it: Temporal Grounding Videos like Flipping Manga Poster Session 3
Yongliang Wu ⋅ Xinting Hu ⋅ Yuyang Sun ⋅ Yizhou Zhou ⋅ Wenbo Zhu ⋅ Fengyun Rao ⋅ Bernt Schiele ⋅ Xu Yang
ExHall D Poster #297
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion Poster Session 4
Xiaomeng Chu ⋅ Jiajun Deng ⋅ Guoliang You ⋅ Yifan Duan ⋅ Houqiang Li ⋅ Yanyong Zhang
ExHall D Poster #121
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Poster Session 6
Tiehan Fan ⋅ Kepan Nan ⋅ Rui Xie ⋅ Penghao Zhou ⋅ Zhenheng Yang ⋅ Chaoyou Fu ⋅ Xiang Li ⋅ Jian Yang ⋅ Ying Tai
ExHall D Poster #270
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers Poster Session 3
Jiazhi Guan ⋅ Kaisiyuan Wang ⋅ Zhiliang Xu ⋅ Quanwei Yang ⋅ Yasheng SUN ⋅ Shengyi He ⋅ Borong Liang ⋅ Yukang Cao ⋅ Yingying Li ⋅ Haocheng Feng ⋅ Errui Ding ⋅ Jingdong Wang ⋅ Youjian Zhao ⋅ Hang Zhou ⋅ Ziwei Liu
ExHall D Poster #3
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment Poster Session 6
Ziteng Cui ⋅ Xuangeng Chu ⋅ Tatsuya Harada
ExHall D Poster #27
Pathways on the Image Manifold: Image Editing via Video Generation Poster Session 2
Noam Rotstein ⋅ Gal Yona ⋅ Daniel Silver ⋅ Roy Velich ⋅ David Bensaid ⋅ Ron Kimmel
ExHall D Poster #238
EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering Poster Session 6
Toshiya Yura ⋅ Ashkan Mirzaei ⋅ Igor Gilitschenski
ExHall D Poster #70
3D Student Splatting and Scooping Poster Session 5
Jialin Zhu ⋅ Jiangbei Yue ⋅ Feixiang He ⋅ He Wang
ExHall D Poster #337
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning Poster Session 6
Peng Wu ⋅ Xiankai Lu ⋅ Hao Hu ⋅ Yongqin Xian ⋅ Jianbing Shen ⋅ Wenguan Wang
ExHall D Poster #398
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs Poster Session 4
Wangbo Zhao ⋅ Yizeng Han ⋅ Jiasheng Tang ⋅ Zhikai Li ⋅ Yibing Song ⋅ Kai Wang ⋅ Zhangyang Wang ⋅ Yang You
ExHall D Poster #382
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather Poster Session 2
Junsung Park ⋅ HwiJeong Lee ⋅ Inha Kang ⋅ Hyunjung Shim
ExHall D Poster #126
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images Poster Session 3
Jie Mei ⋅ Chenyu Lin ⋅ Yu Qiu ⋅ Yaonan Wang ⋅ Hui Zhang ⋅ Ziyang Wang ⋅ Dong Dai
ExHall D Poster #479
MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks Poster Session 6
Zeqi Zhu ⋅ Ibrahim Batuhan Akkaya ⋅ Luc Waeijen ⋅ Egor Bondarev ⋅ Arash Pourtaherian ⋅ Orlando Moreira
ExHall D Poster #302
Probabilistic Prompt Distribution Learning for Animal Pose Estimation Poster Session 6
Jiyong Rao ⋅ Brian Nlong Zhao ⋅ Yu Wang
ExHall D Poster #314
Reconstructing Humans with a Biomechanically Accurate Skeleton Poster Session 2
Yan Xia ⋅ Xiaowei Zhou ⋅ Etienne Vouga ⋅ Qixing Huang ⋅ Georgios Pavlakos
ExHall D Poster #91
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Poster Session 2
Yuanbin Man ⋅ Ying Huang ⋅ Chengming Zhang ⋅ Bingzhe Li ⋅ Wei Niu ⋅ Miao Yin
ExHall D Poster #301
VGGT: Visual Geometry Grounded Transformer Poster Session 2
Jianyuan Wang ⋅ Minghao Chen ⋅ Nikita Karaev ⋅ Andrea Vedaldi ⋅ Christian Rupprecht ⋅ David Novotny
ExHall D Poster #86
Enhancing Facial Privacy Protection via Weakening Diffusion Purification Poster Session 2
Ali Salar ⋅ Qing Liu ⋅ Yingli Tian ⋅ Guoying Zhao
ExHall D Poster #273
Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space Poster Session 1
Zelin Peng ⋅ Zhengqin Xu ⋅ Zhilin Zeng ⋅ Changsong Wen ⋅ Yu Huang ⋅ Menglin Yang ⋅ feilong tang ⋅ Wei Shen
ExHall D Poster #421
TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting Poster Session 1
Bojun Xiong ⋅ Jialun Liu ⋅ JiaKui Hu ⋅ Chenming Wu ⋅ Jinbo Wu ⋅ Xing Liu ⋅ Chen Zhao ⋅ Errui Ding ⋅ Zhouhui Lian
ExHall D Poster #36
OSV: One Step is Enough for High-Quality Image to Video Generation Poster Session 3
Xiaofeng Mao ⋅ Zhengkai Jiang ⋅ Fu-Yun Wang ⋅ Jiangning Zhang ⋅ Hao Chen ⋅ Mingmin Chi ⋅ Yabiao Wang ⋅ Wenhan Luo
ExHall D Poster #185
DyMO: Training-Free Diffusion Model Alignment with Dynamic Multi-Objective Scheduling Poster Session 3
Xin Xie ⋅ Dong Gong
ExHall D Poster #245
HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation Poster Session 1
Yiming Liang ⋅ Tianhan Xu ⋅ Yuta Kikuchi
ExHall D Poster #67
Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition Poster Session 2
Junyi Wu ⋅ Yan Huang ⋅ Min Gao ⋅ Yuzhen Niu ⋅ Yuzhong Chen ⋅ Qiang Wu
ExHall D Poster #400
HD-EPIC: A Highly-Detailed Egocentric Video Dataset Poster Session 5
Toby Perrett ⋅ Ahmad Darkhalil ⋅ Saptarshi Sinha ⋅ Omar Emara ⋅ Sam Pollard ⋅ Kranti Kumar Parida ⋅ Kaiting Liu ⋅ Prajwal Gatti ⋅ Siddhant Bansal ⋅ Kevin Flanagan ⋅ Jacob Chalk ⋅ Zhifan Zhu ⋅ Rhodri Guerrier ⋅ Fahd Abdelazim ⋅ Bin Zhu ⋅ Davide Moltisanti ⋅ Michael Wray ⋅ Hazel Doughty ⋅ Dima Damen
ExHall D Poster #276
PolarFree: Polarization-based Reflection-Free Imaging Poster Session 3
Mingde Yao ⋅ Menglu Wang ⋅ King Man Tam ⋅ Lingen Li ⋅ Tianfan Xue ⋅ Jinwei Gu
ExHall D Poster #22
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis Poster Session 4
Jiapeng Zhu ⋅ Ceyuan Yang ⋅ Kecheng Zheng ⋅ Yinghao Xu ⋅ Zifan Shi ⋅ Yifei Zhang ⋅ Qifeng Chen ⋅ Yujun Shen
ExHall D Poster #250
H-MoRe: Learning Human-centric Motion Representation for Action Analysis Poster Session 5
Zhanbo Huang ⋅ Xiaoming Liu ⋅ Yu Kong
ExHall D Poster #156
LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion. Poster Session 4
Muchen Li ⋅ Sammy Christen ⋅ Chengde Wan ⋅ Yujun Cai ⋅ Renjie Liao ⋅ Leonid Sigal ⋅ Shugao Ma
ExHall D Poster #155
Practical Solutions to the Relative Pose of Three Calibrated Cameras Poster Session 5
Charalambos Tzamos ⋅ Viktor Kocur ⋅ Yaqing Ding ⋅ Daniel Barath ⋅ Zuzana Berger Haladova ⋅ Torsten Sattler ⋅ Zuzana Kukelova
ExHall D Poster #82
GenAssets: Generating in-the-wild 3D Assets in Latent Space Poster Session 5
Ze Yang ⋅ Jingkang Wang ⋅ Haowei Zhang ⋅ Sivabalan Manivasagam ⋅ Yun Chen ⋅ Raquel Urtasun
ExHall D Poster #128
SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation Poster Session 2
Hritam Basak ⋅ Zhaozheng Yin
ExHall D Poster #423
3D-GSW: 3D Gaussian Splatting for Robust Watermarking Poster Session 2
Youngdong Jang ⋅ Hyunje Park ⋅ Feng Yang ⋅ Heeju Ko ⋅ Euijin Choo ⋅ Sangpil Kim
ExHall D Poster #47
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch Poster Session 6
Aneeshan Sain ⋅ Subhajit Maity ⋅ Pinaki Nath Chowdhury ⋅ Subhadeep Koley ⋅ Ayan Kumar Bhunia ⋅ Yi-Zhe Song
ExHall D Poster #211
Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning Poster Session 4
Maosen Zhao ⋅ Pengtao Chen ⋅ Chong Yu ⋅ Yan Wen ⋅ Xudong Tan ⋅ Tao Chen
ExHall D Poster #222
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves Poster Session 3
Shihan Wu ⋅ Ji Zhang ⋅ Pengpeng Zeng ⋅ Lianli Gao ⋅ Jingkuan Song ⋅ Heng Tao Shen
ExHall D Poster #390
DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Post-Capture Refocusing, Defocus Rendering and Blur Removal Poster Session 5
Yujie Wang ⋅ Praneeth Chakravarthula ⋅ Baoquan Chen
ExHall D Poster #24
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers Poster Session 5
Daiqing Qi ⋅ Handong Zhao ⋅ Jing Shi ⋅ Simon Jenni ⋅ Yifei Fan ⋅ Franck Dernoncourt ⋅ Scott Cohen ⋅ Sheng Li
ExHall D Poster #361
Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation Poster Session 6
Shuling Zhao ⋅ Fa-Ting Hong ⋅ Xiaoshui Huang ⋅ Dan Xu
ExHall D Poster #3
Coherent 3D Portrait Video Reconstruction via Triplane Fusion Poster Session 3
Shengze Wang ⋅ Xueting Li ⋅ Chao Liu ⋅ Matthew Chan ⋅ Michael Stengel ⋅ Henry Fuchs ⋅ Shalini De Mello ⋅ Koki Nagano
ExHall D Poster #6
BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation Poster Session 5
Shengze Wang ⋅ Jiefeng Li ⋅ Tianye Li ⋅ Ye Yuan ⋅ Henry Fuchs ⋅ Koki Nagano ⋅ Shalini De Mello ⋅ Michael Stengel
ExHall D Poster #90
GraphI2P: Image-to-Point Cloud Registration with Exploring Pattern of Correspondence via Graph Learning Poster Session 5
Lin Bie ⋅ Shouan Pan ⋅ Siqi Li ⋅ Yining Zhao ⋅ Yue Gao
ExHall D Poster #106
Can't Slow Me Down: Learning Robust and Hardware-Adaptive Object Detectors against Latency Attacks for Edge Devices Poster Session 4
Tianyi Wang ⋅ Zichen Wang ⋅ Cong Wang ⋅ Yuanchao Shu ⋅ Ruilong Deng ⋅ Peng Cheng ⋅ Jiming Chen
ExHall D Poster #327
SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation Poster Session 4
Jihuai Zhao ⋅ Junbao Zhuo ⋅ Jiansheng Chen ⋅ Huimin Ma
ExHall D Poster #336
Bridging Gait Recognition and Large Language Models Sequence Modeling Poster Session 1
Shaopeng Yang ⋅ Jilong Wang ⋅ Saihui Hou ⋅ Xu Liu ⋅ Chunshui Cao ⋅ Liang Wang ⋅ Yongzhen Huang
ExHall D Poster #314
CDI: Copyrighted Data Identification in Diffusion Models Poster Session 4
Jan Dubiński ⋅ Antoni Kowalczuk ⋅ Franziska Boenisch ⋅ Adam Dziedzic
ExHall D Poster #276
Robust Multimodal Survival Prediction with Conditional Latent Differentiation Variational AutoEncoder Poster Session 2
Junjie Zhou ⋅ Jiao Tang ⋅ Yingli Zuo ⋅ Peng Wan ⋅ Daoqiang Zhang ⋅ WEI SHAO
ExHall D Poster #477
Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations Poster Session 4
Ahmad Rahimi ⋅ Po-Chien Luan ⋅ Yuejiang Liu ⋅ Frano Rajič ⋅ Alex Alahi
ExHall D Poster #139
Zero-Shot Blind-spot Image Denoising via Implicit Neural Sampling Poster Session 2
Yuhui Quan ⋅ Tianxiang Zheng ⋅ Zhiyuan Ma ⋅ Hui Ji
ExHall D Poster #204
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models Poster Session 6
Namhyuk Ahn ⋅ KiYoon Yoo ⋅ Wonhyuk Ahn ⋅ Daesik Kim ⋅ Seung-Hun Nam
ExHall D Poster #251
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity Poster Session 3
Huaxin Zhang ⋅ Xiaohao Xu ⋅ Xiang Wang ⋅ Jialong Zuo ⋅ Xiaonan Huang ⋅ Changxin Gao ⋅ Shanjun Zhang ⋅ Li Yu ⋅ Nong Sang
ExHall D Poster #305
Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models Poster Session 4
Jiaming Zhang ⋅ Junhong Ye ⋅ Xingjun Ma ⋅ Yige Li ⋅ Yunfan Yang ⋅ Yunhao Chen ⋅ Jitao Sang ⋅ Dit-Yan Yeung
ExHall D Poster #390
GENIUS: A Generative Framework for Universal Multimodal Search Poster Session 4
Sungyeon Kim ⋅ Xinliang Zhu ⋅ Xiaofan Lin ⋅ Muhammet Bastan ⋅ Douglas Gray ⋅ Suha Kwak
ExHall D Poster #367
Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers Poster Session 3
Quentin Guimard ⋅ Moreno D'Incà ⋅ Massimiliano Mancini ⋅ Elisa Ricci
ExHall D Poster #431
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models Poster Session 4
Chi-Pin Huang ⋅ Yen-Siang Wu ⋅ Hung-Kai Chung ⋅ Kai-Po Chang ⋅ Fu-En Yang ⋅ Yu-Chiang Frank Wang
ExHall D Poster #172
EBS-EKF: Accurate and High Frequency Event-based Star Tracking Poster Session 2
Albert Reed ⋅ Connor Hashemi ⋅ Dennis Melamed ⋅ Nitesh Menon ⋅ Keigo Hirakawa ⋅ Scott McCloskey
ExHall D Poster #108
GliaNet: Adaptive Neural Network Structure Learning with Glia-Driven Poster Session 5
Mengqiao Han ⋅ Liyuan Pan ⋅ Xiabi Liu
ExHall D Poster #401
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation Poster Session 3
Yongkang Li ⋅ Tianheng Cheng ⋅ Bin Feng ⋅ Wenyu Liu ⋅ Xinggang Wang
ExHall D Poster #416
MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving Poster Session 3
Zhi-Yuan Zhang ⋅ Xiaofan Li ⋅ Zhihao Xu ⋅ Wenjie Peng ⋅ Zijian Zhou ⋅ Miaojing Shi ⋅ Shuangping Huang
ExHall D Poster #139
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation Poster Session 2
Jiho Choi ⋅ Seonho Lee ⋅ Minhyun Lee ⋅ Seungho Lee ⋅ Hyunjung Shim
ExHall D Poster #420
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training Poster Session 2
Dar-Yen Chen ⋅ Hmrishav Bandyopadhyay ⋅ Kai Zou ⋅ Yi-Zhe Song
ExHall D Poster #218
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Poster Session 2
Yifan Pu ⋅ Yiming Zhao ⋅ Zhicong Tang ⋅ Ruihong Yin ⋅ Haoxing Ye ⋅ Yuhui Yuan ⋅ Dong Chen ⋅ Jianmin Bao ⋅ Sirui Zhang ⋅ Yanbin Wang ⋅ Lin Liang ⋅ Lijuan Wang ⋅ Ji Li ⋅ Xiu Li ⋅ Zhouhui Lian ⋅ Gao Huang ⋅ Baining Guo
ExHall D Poster #247
Rotation-Equivariant Self-Supervised Method in Image Denoising Poster Session 3
Hanze Liu ⋅ Jiahong Fu ⋅ Qi Xie ⋅ Deyu Meng
ExHall D Poster #198
GLane3D: Detecting Lanes with Graph of 3D Keypoints Poster Session 6
Halil İbrahim Öztürk ⋅ Muhammet Esat Kalfaoglu ⋅ Ozsel Kilinc
ExHall D Poster #129
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation Poster Session 5
Ningyuan Tang ⋅ Minghao Fu ⋅ Jianxin Wu
ExHall D Poster #398
Hardware-Rasterized Ray-Based Gaussian Splatting Poster Session 1
Samuel Rota Bulò ⋅ Lorenzo Porzi ⋅ Nemanja Bartolovic ⋅ Peter Kontschieder
ExHall D Poster #30
Generating Multimodal Driving Scenes via Next-Scene Prediction Poster Session 2
Yanhao Wu ⋅ Haoyang Zhang ⋅ Tianwei Lin ⋅ Alan Huang ⋅ Shujie Luo ⋅ Rui Wu ⋅ Congpei Qiu ⋅ Wei Ke ⋅ Tong Zhang
ExHall D Poster #141
BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting Poster Session 4
Jeongwan On ⋅ Kyeonghwan Gwak ⋅ Gunyoung Kang ⋅ Junuk Cha ⋅ Soohyun Hwang ⋅ Hyein Hwang ⋅ Seungryul Baek
ExHall D Poster #157
Language Guided Concept Bottleneck Models for Interpretable Continual Learning Poster Session 3
Lu Yu ⋅ HaoYu Han ⋅ Zhe Tao ⋅ Hantao Yao ⋅ Changsheng Xu
ExHall D Poster #414
Domain Adaptive Diabetic Retinopathy Grading with Model Absence and Flowing Data Poster Session 6
Wenxin Su ⋅ Song Tang ⋅ Xiaofeng Liu ⋅ Xiaojing Yi ⋅ Mao Ye ⋅ Chunxiao Zu ⋅ Jiahao Li ⋅ Xiatian Zhu
ExHall D Poster #207
Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks Poster Session 2
Kairong Yu ⋅ Chengting Yu ⋅ Tianqing Zhang ⋅ Xiaochen Zhao ⋅ Shu Yang ⋅ Hongwei Wang ⋅ Qiang Zhang ⋅ Qi Xu
ExHall D Poster #328
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models Poster Session 6
Jian Liang ⋅ Wenke Huang ⋅ Guancheng Wan ⋅ Qu Yang ⋅ Mang Ye
ExHall D Poster #329
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner Poster Session 2
Yuyang Huang ⋅ Yabo Chen ⋅ Li Ding ⋅ Xiaopeng Zhang ⋅ Wenrui Dai ⋅ Junni Zou ⋅ Hongkai Xiong ⋅ Qi Tian
ExHall D Poster #182
EigenGS Representation: From Eigenspace to Gaussian Image Space Poster Session 3
LO-WEI TAI ⋅ Ching-En Ching En, Li ⋅ Cheng-Lin Chen ⋅ Chih-Jung Tsai ⋅ Hwann-Tzong Chen ⋅ Tyng-Luh Liu
ExHall D Poster #271
SmartCLIP: Modular Vision-language Alignment with Identification Guarantees Poster Session 6
Shaoan Xie ⋅ Lingjing Kong ⋅ Yujia Zheng ⋅ Yu Yao ⋅ Zeyu Tang ⋅ Eric P. Xing ⋅ Guangyi Chen ⋅ Kun Zhang
ExHall D Poster #348
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection Poster Session 1
Xin Jin ⋅ Haisheng Su ⋅ Kai Liu ⋅ CONG MA ⋅ Wei Wu ⋅ Fei HUI ⋅ Junchi Yan
ExHall D Poster #115
MaSS13K: A Matting-level Semantic Segmentation Benchmark Poster Session 3
Chenxi Xie ⋅ Minghan LI ⋅ Hui Zeng ⋅ Jun Luo ⋅ Lei Zhang
ExHall D Poster #325
EntropyMark: Towards More Harmless Backdoor Watermark via Entropy-based Constraint for Open-source Dataset Copyright Protection Poster Session 6
Ming Sun ⋅ Rui Wang ⋅ Zixuan Zhu ⋅ Lihua Jing ⋅ Yuanfang Guo
ExHall D Poster #435
Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game Poster Session 2
Keyizhi Xu ⋅ Chi Zhang ⋅ Zhan Chen ⋅ Zhongyuan Wang ⋅ Chunxia Xiao ⋅ Chao Liang
ExHall D Poster #466
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers Poster Session 6
Lei Chen ⋅ Yuan Meng ⋅ Chen Tang ⋅ Xinzhu Ma ⋅ Jingyan Jiang ⋅ Xin Wang ⋅ Zhi Wang ⋅ Wenwu Zhu
ExHall D Poster #204
Video-Guided Foley Sound Generation with Multimodal Controls Poster Session 4
Ziyang Chen ⋅ Prem Seetharaman ⋅ Bryan Russell ⋅ Oriol Nieto ⋅ David Bourgin ⋅ Andrew Owens ⋅ Justin Salamon
ExHall D Poster #285
SACB-Net: Spatial-awareness Convolutions for Medical Image Registration Poster Session 1
Xinxing Cheng ⋅ Tianyang Zhang ⋅ Wenqi Lu ⋅ Qingjie Meng ⋅ Alejandro F Frangi ⋅ Jinming Duan
ExHall D Poster #484
DCEvo: Discriminative Cross-Dimensional Evolutionary Learning for Infrared and Visible Image Fusion Poster Session 1
Jinyuan Liu ⋅ Bowei Zhang ⋅ Qingyun Mei ⋅ Xingyuan Li ⋅ Yang Zou ⋅ Zhiying Jiang ⋅ Long Ma ⋅ Risheng Liu ⋅ Xin Fan
ExHall D Poster #193
TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution Poster Session 5
linwei dong ⋅ Qingnan Fan ⋅ Yihong Guo ⋅ Zhonghao Wang ⋅ Qi Zhang ⋅ Jinwei Chen ⋅ Yawei Luo ⋅ Changqing Zou
ExHall D Poster #202
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Poster Session 5
Jianing "Jed" Yang ⋅ Alexander Sax ⋅ Kevin Liang ⋅ Mikael Henaff ⋅ Hao Tang ⋅ Ang Cao ⋅ Joyce Chai ⋅ Franziska Meier ⋅ Matt Feiszli
ExHall D Poster #83
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning Poster Session 6
Aniket Rajiv Didolkar ⋅ Andrii Zadaianchuk ⋅ Rabiul Awal ⋅ Maximilian Seitzer ⋅ Efstratios Gavves ⋅ Aishwarya Agrawal
ExHall D Poster #322
PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection Poster Session 1
Jianan Ye ⋅ Weiguang Zhao ⋅ Xi Yang ⋅ Guangliang Cheng ⋅ Kaizhu Huang
ExHall D Poster #110
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation Poster Session 4
Gyeongrok Oh ⋅ Sung June Kim ⋅ Heeju Ko ⋅ Hyunggun Chi ⋅ Jinkyu Kim ⋅ Dongwook Lee ⋅ Daehyun Ji ⋅ Sungjoon Choi ⋅ Sujin Jang ⋅ Sangpil Kim
ExHall D Poster #126
Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? Poster Session 3
Yuan-Hong Liao ⋅ Rafid Mahmood ⋅ Sanja Fidler ⋅ David Acuna
ExHall D Poster #385
SET: Spectral Enhancement for Tiny Object Detection Poster Session 1
Huixin Sun ⋅ Runqi Wang ⋅ Yanjing Li ⋅ Linlin Yang ⋅ Shaohui Lin ⋅ Xianbin Cao ⋅ Baochang Zhang
ExHall D Poster #435
g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks Poster Session 3
Zihan Wang ⋅ Gim Hee Lee
ExHall D Poster #339
Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance Poster Session 3
Dimitrios Gerogiannis ⋅ Foivos Paraperas Papantoniou ⋅ Rolandos Alexandros Potamias ⋅ Alexandros Lattas ⋅ Stefanos Zafeiriou
ExHall D Poster #11
UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning Poster Session 6
Weiqi Yan ⋅ Lvhai Chen ⋅ Huaijia Kou ⋅ Shengchuan Zhang ⋅ Yan Zhang ⋅ Liujuan Cao
ExHall D Poster #404
Geometry in Style: 3D Stylization via Surface Normal Deformation Poster Session 6
Nam Anh Dinh ⋅ Itai Lang ⋅ Hyunwoo Kim ⋅ Oded Stein ⋅ Rana Hanocka
ExHall D Poster #219
Multi-modal Vision Pre-training for Medical Image Analysis Poster Session 1
Shaohao Rui ⋅ Lingzhi Chen ⋅ Zhenyu Tang ⋅ Lilong Wang ⋅ Mianxin Liu ⋅ Shaoting Zhang ⋅ Xiaosong Wang
ExHall D Poster #478
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation Poster Session 4
Yunxiang Fu ⋅ Meng Lou ⋅ Yizhou Yu
ExHall D Poster #313
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Poster Session 3
Xudong Jiang ⋅ Fangjinhua Wang ⋅ Silvano Galliani ⋅ Christoph Vogel ⋅ Marc Pollefeys
ExHall D Poster #85
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution Poster Session 5
Yuzhong Zhao ⋅ Feng Liu ⋅ Yue Liu ⋅ Mingxiang Liao ⋅ Chen GONG ⋅ Qixiang Ye ⋅ Fang Wan
ExHall D Poster #355
OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities Poster Session 4
Suyoung Lee ⋅ JAEYOUNG CHUNG ⋅ Kihoon Kim ⋅ Jaeyoo Huh ⋅ Gunhee Lee ⋅ Minsoo Lee ⋅ Kyoung Mu Lee
ExHall D Poster #49
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Poster Session 6
Zhongwei Ren ⋅ Yunchao Wei ⋅ Xun Guo ⋅ Yao Zhao ⋅ Bingyi Kang ⋅ Jiashi Feng ⋅ Xiaojie Jin
ExHall D Poster #276
Optical-Flow Guided Prompt Optimization for Coherent Video Generation Poster Session 2
Hyelin Nam ⋅ Jaemin Kim ⋅ Dohun Lee ⋅ Jong Chul Ye
ExHall D Poster #236
MOS: Modeling Object-Scene Associations in Generalized Category Discovery Poster Session 3
Zhengyuan Peng ⋅ Jinpeng Ma ⋅ Zhimin Sun ⋅ Ran Yi ⋅ Haichuan Song ⋅ Xin Tan ⋅ Lizhuang Ma
ExHall D Poster #428
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D Poster Session 5
Jiawei Tan ⋅ Hongxing Wang ⋅ Junwu Weng ⋅ Jiaxin Li ⋅ Zhilong Ou ⋅ Kang Dang
ExHall D Poster #302
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing Poster Session 6
Tong Wang ⋅ Ting Liu ⋅ Xiaochao Qu ⋅ WU CHENGJING ⋅ Luoqi Liu ⋅ Xiaolin Hu
ExHall D Poster #225
Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories Poster Session 4
Susung Hong ⋅ Johanna Suvi Karras ⋅ Ricardo Martin ⋅ Ira Kemelmacher-Shlizerman
ExHall D Poster #42
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning Poster Session 5
Lei-Lei Ma ⋅ Shuo Xu ⋅ Ming-Kun Xie ⋅ Lei Wang ⋅ Dengdi Sun ⋅ Haifeng Zhao
ExHall D Poster #419
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Poster Session 3
Enis Simsar ⋅ Thomas Hofmann ⋅ Federico Tombari ⋅ Pinar Yanardag
ExHall D Poster #242
ArtFormer: Controllable Generation of Diverse 3D Articulated Objects Poster Session 1
Jiayi Su ⋅ Youhe Feng ⋅ Zheng Li ⋅ Jinhua Song ⋅ Yangfan He ⋅ Botao Ren ⋅ Botian Xu
ExHall D Poster #160
Opportunistic Single-Photon Time of Flight Poster Session 4
Sotiris Nousias ⋅ Mian Wei ⋅ Howard Xiao ⋅ Maxx Wu ⋅ Shahmeer Athar ⋅ Kevin J Wang ⋅ Anagh Malik ⋅ David A. Barmherzig ⋅ David B. Lindell ⋅ Kiriakos Kutulakos
ExHall D Poster #68
Finsler Multi-Dimensional Scaling: Manifold Learning for Asymmetric Dimensionality Reduction and Embedding Poster Session 5
Thomas Dagès ⋅ Simon Weber ⋅ Ya-Wei Eileen Lin ⋅ Ronen Talmon ⋅ Daniel Cremers ⋅ Michael Lindenbaum ⋅ Alfred M. Bruckstein ⋅ Ron Kimmel
ExHall D Poster #462
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought Poster Session 3
Yunze Man ⋅ De-An Huang ⋅ Guilin Liu ⋅ Shiwei Sheng ⋅ Shilong Liu ⋅ Liangyan Gui ⋅ Jan Kautz ⋅ Yu-Xiong Wang ⋅ Zhiding Yu
ExHall D Poster #346
Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations Poster Session 3
Jungin Park ⋅ Jiyoung Lee ⋅ Kwanghoon Sohn
ExHall D Poster #288
SFDM: Robust Decomposition of Geometry and Reflectance for Realistic Face Rendering from Sparse-view Images Poster Session 6
Daisheng Jin ⋅ Jiangbei Hu ⋅ Baixin Xu ⋅ Yuxin Dai ⋅ Chen Qian ⋅ Ying He
ExHall D Poster #21
DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery Poster Session 1
Jing Gao ⋅ Ce Zheng ⋅ Laszlo Jeni ⋅ Zackory Erickson
ExHall D Poster #154
Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation Poster Session 6
Qiang Zhang ⋅ Mengsheng Zhao ⋅ Jiawei Liu ⋅ Fanrui Zhang ⋅ Yongchao Xu ⋅ Zheng-Jun Zha
ExHall D Poster #419
A Regularization-Guided Equivariant Approach for Image Restoration Poster Session 1
Yulu Bai ⋅ Jiahong Fu ⋅ Qi Xie ⋅ Deyu Meng
ExHall D Poster #201
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification Poster Session 5
Darryl Ho ⋅ Samuel Madden
ExHall D Poster #287
All-Optical Nonlinear Diffractive Deep Network for Ultrafast Image Denoising Poster Session 6
Xiaoling Zhou ⋅ Zhemg Lee ⋅ Wei Ye ⋅ Rui Xie ⋅ Wenbo Zhang ⋅ Guanju Peng ⋅ Zongze Li ⋅ Shikun Zhang
ExHall D Poster #196
CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation Poster Session 1
Bonan Li ⋅ Zicheng Zhang ⋅ Xingyi Yang ⋅ Xinchao Wang
ExHall D Poster #260
HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment Poster Session 5
Armin Shafiee Sarvestani ⋅ Sheyang Tang ⋅ Zhou Wang
ExHall D Poster #35
Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model Poster Session 4
Jian Zhu ⋅ He Wang ⋅ Yang Xu ⋅ Zebin Wu ⋅ Zhihui Wei
ExHall D Poster #196
SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model Poster Session 5
Yucheng Mao ⋅ Boyang Wang ⋅ Nilesh Kulkarni ⋅ Jeong Joon Park
ExHall D Poster #54
StickMotion: Generating 3D Human Motions by Drawing a Stickman Poster Session 3
Tao Wang ⋅ Zhihua Wu ⋅ Qiaozhi He ⋅ Jiaming Chu ⋅ Ling Qian ⋅ Yu Cheng ⋅ Junliang Xing ⋅ Jian Zhao ⋅ Lei Jin
ExHall D Poster #164
Enduring, Efficient and Robust Trajectory Prediction Attack in Autonomous Driving via Optimization-Driven Multi-Frame Perturbation Framework Poster Session 4
Yi Yu ⋅ Weizhen Han ⋅ Libing Wu ⋅ Bingyi Liu ⋅ Enshu Wang ⋅ Zhuangzhuang Zhang
ExHall D Poster #135
GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks Poster Session 1
Haoqiang Kang ⋅ Enna Sachdeva ⋅ Piyush Gupta ⋅ Sangjae Bae ⋅ Kwonjoon Lee
ExHall D Poster #347
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics Poster Session 4
Chan Hee Song ⋅ Valts Blukis ⋅ Jonathan Tremblay ⋅ Stephen Tyree ⋅ Yu Su ⋅ Stan Birchfield
ExHall D Poster #146
Segment Any-Quality Images with Generative Latent Space Enhancement Poster Session 1
Guangqian Guo ⋅ Yong Guo ⋅ Xuehui Yu ⋅ Wenbo Li ⋅ Yaoxing Wang ⋅ Shan Gao
ExHall D Poster #207
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents Poster Session 5
Jun Chen ⋅ Dannong Xu ⋅ Junjie Fei ⋅ Chun-Mei Feng ⋅ Mohamed Elhoseiny
ExHall D Poster #362
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages Poster Session 6
Matteo Farina ⋅ Massimiliano Mancini ⋅ Giovanni Iacca ⋅ Elisa Ricci
ExHall D Poster #367
TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model Poster Session 5
Zhichao Zhai ⋅ Guikun Chen ⋅ Wenguan Wang ⋅ Dong Zheng ⋅ Jun Xiao
ExHall D Poster #11
MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Poster Session 1
Shuo Wang ⋅ Wanting Li ⋅ Yongcai Wang ⋅ Zhaoxin Fan ⋅ Zhe Huang ⋅ xudong cai ⋅ Jian Zhao ⋅ Deying Li
ExHall D Poster #101
Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization Poster Session 4
Sihao Liu ⋅ Yibo Yang ⋅ Xiaojie Li ⋅ David A. Clifton ⋅ Bernard Ghanem
ExHall D Poster #447
Explaining in Diffusion: Explaining a Classifier with Diffusion Semantics Poster Session 3
Tahira Kazimi ⋅ Ritika Allada ⋅ Pinar Yanardag
ExHall D Poster #397
Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes Poster Session 6
Lihan Jiang ⋅ Kerui Ren ⋅ Mulin Yu ⋅ Linning Xu ⋅ Junting Dong ⋅ Tao Lu ⋅ Feng Zhao ⋅ Dahua Lin ⋅ Bo Dai
ExHall D Poster #62
Attention Distillation: A Unified Approach to Visual Characteristics Transfer Poster Session 4
Yang Zhou ⋅ Xu Gao ⋅ Zichong Chen ⋅ Hui Huang
ExHall D Poster #236
Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting Poster Session 4
Jinbo Yan ⋅ Rui Peng ⋅ Zhiyan Wang ⋅ Luyang Tang ⋅ Jiayu Yang ⋅ Jie Liang ⋅ Jiahao Wu ⋅ Ronggang Wang
ExHall D Poster #65
DreamRelation: Bridging Customization and Relation Generation Poster Session 4
Qingyu Shi ⋅ Lu Qi ⋅ Jianzong Wu ⋅ Jinbin Bai ⋅ Jingbo Wang ⋅ Yunhai Tong ⋅ Xiangtai Li
ExHall D Poster #251
IndoorGS: Geometric Cues Guided Gaussian Splatting for Indoor Scene Reconstruction Poster Session 1
Cong Ruan ⋅ Yuesong Wang ⋅ Bin Zhang ⋅ Lili Ju ⋅ Tao Guan
ExHall D Poster #63
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Poster Session 4
Ashmal Vayani ⋅ Dinura Dissanayake ⋅ Hasindri Watawana ⋅ Noor Ahsan ⋅ Nevasini Sasikumar ⋅ Omkar Thawakar ⋅ Henok Biadglign Ademtew ⋅ Yahya Hmaiti ⋅ Amandeep Kumar ⋅ Kartik Kuckreja ⋅ Mykola Maslych ⋅ Wafa Al Ghallabi ⋅ Mihail Minkov Mihaylov ⋅ Chao Qin ⋅ Abdelrahman Shaker ⋅ Mike Zhang ⋅ Mahardika Krisna Ihsani ⋅ Amiel Gian Esplana ⋅ Monil Gokani ⋅ Shachar Mirkin ⋅ Harsh Singh ⋅ Ashay Srivastava ⋅ Endre Hamerlik ⋅ Fathinah Asma Izzati ⋅ Fadillah Adamsyah Maani ⋅ Sebastian Cavada ⋅ Jenny Chim ⋅ Rohit Gupta ⋅ Sanjay Manjunath ⋅ Kamila Zhumakhanova ⋅ Feno Heriniaina Rabevohitra ⋅ Azril Hafizi Amirudin ⋅ Muhammad Ridzuan ⋅ Daniya Najiha Abdul Kareem ⋅ Ketan Pravin More ⋅ Kunyang Li ⋅ Pramesh Shakya ⋅ Muhammad Saad ⋅ Amirpouya Ghasemaghaei ⋅ Amirbek Djanibekov ⋅ Dilshod Azizov ⋅ Branislava Jankovic ⋅ Naman Bhatia ⋅ Alvaro Cabrera Berobide ⋅ Johan Obando-Ceron ⋅ Olympiah Otieno ⋅ Fabian Farestam ⋅ Muztoba Rabbani ⋅ Sanoojan Baliah ⋅ Santosh Sanjeev ⋅ Abduragim Shtanchaev ⋅ Maheen Fatima ⋅ Thao Nguyen ⋅ Amrin Kareem ⋅ Toluwani Aremu ⋅ Nathan Augusto Zacarias Xavier ⋅ Amit Bhatkal ⋅ Hawau Olamide Toyin ⋅ Aman Chadha ⋅ Hisham Cholakkal ⋅ Rao Anwer ⋅ Michael Felsberg ⋅ Jorma Laaksonen ⋅ Thamar Solorio ⋅ Monojit Choudhury ⋅ Ivan Laptev ⋅ Mubarak Shah ⋅ Salman Khan ⋅ Fahad Shahbaz Khan
ExHall D Poster #358
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis Poster Session 1
Hongyu Sun ⋅ Qiuhong Ke ⋅ Ming Cheng ⋅ Yongcai Wang ⋅ Deying Li ⋅ Chenhui Gou ⋅ Jianfei Cai
ExHall D Poster #102
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation Poster Session 1
Zhenyu Wu ⋅ Yuheng Zhou ⋅ Xiuwei Xu ⋅ Ziwei Wang ⋅ Haibin Yan
ExHall D Poster #144
Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction Poster Session 3
Li Fang ⋅ Hao Zhu ⋅ Longlong Chen ⋅ Fei Hu ⋅ Long Ye ⋅ Zhan Ma
ExHall D Poster #54
Text Augmented Correlation Transformer For Few-shot Classification & Segmentation Poster Session 5
Srinivasa Rao Nandam ⋅ Sara Atito ⋅ Zhenhua Feng ⋅ Josef Kittler ⋅ Muhammad Awais
ExHall D Poster #412
Ref-GS: Directional Factorization for 2D Gaussian Splatting Poster Session 6
Youjia Zhang ⋅ Anpei Chen ⋅ Yumin Wan ⋅ Zikai Song ⋅ Junqing Yu ⋅ Yawei Luo ⋅ Wei Yang
ExHall D Poster #30
Pose Priors from Language Models Poster Session 2
Sanjay Subramanian ⋅ Evonne Ng ⋅ Lea Müller ⋅ Dan Klein ⋅ Shiry Ginosar ⋅ Trevor Darrell
ExHall D Poster #169
Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction Poster Session 6
Seungtae Nam ⋅ Xiangyu Sun ⋅ Gyeongjin Kang ⋅ Younggeun Lee ⋅ Seungjun Oh ⋅ Eunbyung Park
ExHall D Poster #50
HyperGS: Hyperspectral 3D Gaussian Splatting Poster Session 2
Christopher Thirgood ⋅ Oscar Mendez ⋅ Erin Chao Ling ⋅ Jonathan Storey ⋅ Simon Hadfield
ExHall D Poster #50
Towards Optimizing Large-Scale Multi-Graph Matching in Bioimaging Poster Session 3
Max Kahl ⋅ Sebastian Stricker ⋅ Lisa Hutschenreiter ⋅ Florian Bernard ⋅ Carsten Rother ⋅ Bogdan Savchynskyy
ExHall D Poster #89
SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors Poster Session 2
Yufan Wu ⋅ Xuanhong Chen ⋅ Wen Li ⋅ Shunran Jia ⋅ Hualiang Wei ⋅ Kairui Feng ⋅ Jialiang CHEN ⋅ Yuhan Li ⋅ Ang He ⋅ Weimin Zhang ⋅ Bingbing Ni ⋅ Wenjun Zhang
ExHall D Poster #12
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing Poster Session 2
Seokhyeon Hong ⋅ Chaelin Kim ⋅ Serin Yoon ⋅ Junghyun Nam ⋅ Sihun Cha ⋅ Junyong Noh
ExHall D Poster #172
Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding Poster Session 3
Andong Deng ⋅ Zhongpai Gao ⋅ Anwesa Choudhuri ⋅ Benjamin Planche ⋅ Meng Zheng ⋅ Bin Wang ⋅ Terrence Chen ⋅ Chen Chen ⋅ Ziyan Wu
ExHall D Poster #298
Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients Poster Session 1
Li Lun ⋅ Kunyu Feng ⋅ Qinglong Ni ⋅ Ling Liang ⋅ Yuan Wang ⋅ Ying Li ⋅ dunshan yu ⋅ Xiaoxin CUI
ExHall D Poster #321
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation Poster Session 2
Gianni Franchi ⋅ Nacim Belkhir ⋅ Dat NGUYEN ⋅ Guoxuan Xia ⋅ Andrea Pilzer
ExHall D Poster #257
CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images Poster Session 4
Jungho Lee ⋅ Suhwan Cho ⋅ Taeoh Kim ⋅ Ho-Deok Jang ⋅ Minhyeok Lee ⋅ Geonho Cha ⋅ Dongyoon Wee ⋅ Dogyoon Lee ⋅ Sangyoun Lee
ExHall D Poster #24
PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention Poster Session 4
Weicheng Wang ⋅ Guoli Jia ⋅ Zhongqi Zhang ⋅ Liang Lin ⋅ Jufeng Yang
ExHall D Poster #239
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models Poster Session 4
Shuyang Hao ⋅ Bryan Hooi ⋅ Jun Liu ⋅ Kai-Wei Chang ⋅ Zi Huang ⋅ Yujun Cai
ExHall D Poster #389
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Poster Session 2
Hongyu Li ⋅ Jinyu Chen ⋅ Ziyu Wei ⋅ Shaofei Huang ⋅ Tianrui Hui ⋅ Jialin Gao ⋅ Xiaoming Wei ⋅ Si Liu
ExHall D Poster #307
Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks Poster Session 6
Yong Xie ⋅ Weijie Zheng ⋅ Hanxun Huang ⋅ Guangnan Ye ⋅ Xingjun Ma
ExHall D Poster #436
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark Poster Session 2
Xin Zhang ⋅ Xue Yang ⋅ Yuxuan Li ⋅ Jian Yang ⋅ Ming-Ming Cheng ⋅ Xiang Li
ExHall D Poster #196
Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection Poster Session 5
Qi Chen ⋅ Hu Ding
ExHall D Poster #449
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry Poster Session 5
Jing Li ⋅ Yihang Fu ⋅ Falai Chen
ExHall D Poster #37
Revisiting MAE Pre-training for 3D Medical Image Segmentation Poster Session 1
Tassilo Wald ⋅ Constantin Ulrich ⋅ Stanislav Lukyanenko ⋅ Andrei Goncharov ⋅ Alberto Paderno ⋅ Maximilian Miller ⋅ Leander Maerkisch ⋅ Paul F Jaeger ⋅ Klaus Maier-Hein
ExHall D Poster #480
Task-driven Image Fusion with Learnable Fusion Loss Poster Session 2
Haowen Bai ⋅ Jiangshe Zhang ⋅ Zixiang Zhao ⋅ Yichen Wu ⋅ Lilun Deng ⋅ Yukun Cui ⋅ Tao Feng ⋅ Shuang Xu
ExHall D Poster #200
Compositional Targeted Multi-Label Universal Perturbations Poster Session 4
Hassan Mahmood ⋅ Ehsan Elhamifar
ExHall D Poster #454
PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies Poster Session 4
Mojtaba Nafez ⋅ Amirhossein Koochakian ⋅ Arad Maleki ⋅ Jafar Habibi ⋅ Mohammad Rohban
ExHall D Poster #436
Distilling Monocular Foundation Model for Fine-grained Depth Completion Poster Session 5
Yingping Liang ⋅ Yutao Hu ⋅ Wenqi Shao ⋅ Ying Fu
ExHall D Poster #115
Diffusion Renderer: Neural Inverse and Forward Rendering with Video Diffusion Models Poster Session 6
Ruofan Liang ⋅ Žan Gojčič ⋅ Huan Ling ⋅ Jacob Munkberg ⋅ Jon Hasselgren ⋅ Chih-Hao Lin ⋅ Jun Gao ⋅ Alexander Keller ⋅ Nandita Vijaykumar ⋅ Sanja Fidler ⋅ Zian Wang
ExHall D Poster #29
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant Poster Session 1
Yikun Liu ⋅ Yajie Zhang ⋅ jiayin cai ⋅ Xiaolong Jiang ⋅ Yao Hu ⋅ Jiangchao Yao ⋅ Yanfeng Wang ⋅ Weidi Xie
ExHall D Poster #366
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning Poster Session 5
Gaojian Wang ⋅ Feng Lin ⋅ Tong Wu ⋅ Zhenguang Liu ⋅ Zhongjie Ba ⋅ Kui Ren
ExHall D Poster #319
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion Poster Session 4
Zhenglin Zhou ⋅ Fan Ma ⋅ Hehe Fan ⋅ Tat-seng Chua
ExHall D Poster #8
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events Poster Session 4
Jesse Hagenaars ⋅ Yilun Wu ⋅ Federico Paredes Valles ⋅ Stein Stroobants ⋅ Guido De Croon
ExHall D Poster #124
LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds Poster Session 1
Zihui Zhang ⋅ Weisheng Dai ⋅ Hongtao Wen ⋅ Bo Yang
ExHall D Poster #112
One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception Poster Session 1
Yuchen Xia ⋅ Quan Yuan ⋅ Guiyang Luo ⋅ Xiaoyuan Fu ⋅ Yang Li ⋅ Xuanhan Zhu ⋅ Tianyou Luo ⋅ Siheng Chen ⋅ Jinglin Li
ExHall D Poster #133
Dynamic Updates for Language Adaptation in Visual-Language Tracking Poster Session 4
Xiaohai Li ⋅ Bineng Zhong ⋅ Qihua Liang ⋅ Zhiyi Mo ⋅ Jian Nong ⋅ Shuxiang Song
ExHall D Poster #321
CustAny: Customizing Anything from A Single Example Poster Session 5
Lingjie Kong ⋅ Kai WU ⋅ Chengming Xu ⋅ Xiaobin Hu ⋅ Wenhui Han ⋅ Jinlong Peng ⋅ Donghao Luo ⋅ Mengtian Li ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yanwei Fu
ExHall D Poster #246
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation Poster Session 2
Wei Chen ⋅ Lin Li ⋅ Yongqi Yang ⋅ Bin Wen ⋅ Fan Yang ⋅ Tingting Gao ⋅ Yu Wu ⋅ Long Chen
ExHall D Poster #258
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision Poster Session 2
Ruicheng Wang ⋅ Sicheng Xu ⋅ Cassie Lee Dai ⋅ Jianfeng XIANG ⋅ Yu Deng ⋅ Xin Tong ⋅ Jiaolong Yang
ExHall D Poster #113
ScaleLSD: Scalable Deep Line Segment Detection Streamlined Poster Session 2
Zeran Ke ⋅ Bin Tan ⋅ Xianwei Zheng ⋅ Yujun Shen ⋅ Tianfu Wu ⋅ Nan Xue
ExHall D Poster #89
Learning with Noisy Triplet Correspondence for Composed Image Retrieval Poster Session 4
Shuxian Li ⋅ Changhao He ⋅ XitingLiu ⋅ Joey Tianyi Zhou ⋅ Xi Peng ⋅ Peng Hu
ExHall D Poster #364
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression Poster Session 3
Bo Tong ⋅ Bokai Lai ⋅ Yiyi Zhou ⋅ Luo ⋅ Yunhang Shen ⋅ Ke Li ⋅ Xiaoshuai Sun ⋅ Rongrong Ji
ExHall D Poster #375
DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting Poster Session 5
Seungjun Lee ⋅ Gim Hee Lee
ExHall D Poster #65
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction Poster Session 2
Ben Kaye ⋅ Tomas Jakab ⋅ Shangzhe Wu ⋅ Christian Rupprecht ⋅ Andrea Vedaldi
ExHall D Poster #100
CGMatch: A Different Perspective of Semi-supervised Learning Poster Session 3
Bo Cheng ⋅ Jueqing Lu ⋅ Yuan Tian ⋅ Haifeng Zhao ⋅ Yi Chang ⋅ Lan Du
ExHall D Poster #453
ChatHuman: Chatting about 3D Humans with Tools Poster Session 2
Jing Lin ⋅ Yao Feng ⋅ Weiyang Liu ⋅ Michael J. Black
ExHall D Poster #265
Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection Poster Session 6
Jinhyung Park ⋅ Navyata Sanghvi ⋅ Hiroki Adachi ⋅ Yoshihisa Shibata ⋅ Shawn Hunt ⋅ Shinya Tanaka ⋅ Hironobu Fujiyioshi ⋅ Kris Kitani
ExHall D Poster #119
Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual Poster Session 5
Chong Wang ⋅ Lanqing Guo ⋅ Zixuan Fu ⋅ SIYUAN YANG ⋅ Hao Cheng ⋅ Alex C. Kot ⋅ Bihan Wen
ExHall D Poster #205
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport Poster Session 3
Mengnan Liu ⋅ Le Wang ⋅ Sanping Zhou ⋅ Kun Xia ⋅ Xiaolong Sun ⋅ Gang Hua
ExHall D Poster #307
Hierarchical Flow Diffusion for Efficient Frame Interpolation Poster Session 5
Yang Hai ⋅ Guo Wang ⋅ Tan Su ⋅ jerett ⋅ Yinlin Hu
ExHall D Poster #179
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval Poster Session 2
Davide Caffagni ⋅ Sara Sarto ⋅ Marcella Cornia ⋅ Lorenzo Baraldi ⋅ Rita Cucchiara
ExHall D Poster #373
Camouflage Anything: Learning to Hide using Controlled Out-painting and Representation Engineering Poster Session 1
Biplab Das ⋅ Viswanath Gopalakrishnan
ExHall D Poster #327
Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability Poster Session 1
Unki Park ⋅ Seongmoon Jeong ⋅ Jang Youngchan ⋅ Gyeong-Moon Park ⋅ Jong Hwan Ko
ExHall D Poster #409
BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation Poster Session 6
Yulu Pan ⋅ Ce Zhang ⋅ Gedas Bertasius
ExHall D Poster #267
AniDoc: Animation Creation Made Easier Poster Session 4
Yihao Meng ⋅ Hao Ouyang ⋅ Hanlin Wang ⋅ Qiuyu Wang ⋅ Wen Wang ⋅ Ka Leong Cheng ⋅ Zhiheng Liu ⋅ Yujun Shen ⋅ Huamin Qu
ExHall D Poster #227
TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting Poster Session 3
Jianchuan Chen ⋅ Jingchuan Hu ⋅ Gaige Wang ⋅ Zhonghua Jiang ⋅ Tiansong Zhou ⋅ Zhiwen Chen ⋅ Chengfei Lv
ExHall D Poster #7
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video Poster Session 1
David Yifan Yao ⋅ Albert J. Zhai ⋅ Shenlong Wang
ExHall D Poster #88
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models Poster Session 4
Yongting Zhang ⋅ Lu Chen ⋅ Guodong Zheng ⋅ Yifeng Gao ⋅ Rui Zheng ⋅ Jinlan Fu ⋅ Zhenfei Yin ⋅ Senjie Jin ⋅ Yu Qiao ⋅ Xuanjing Huang ⋅ Feng Zhao ⋅ Tao Gui ⋅ Jing Shao
ExHall D Poster #387
UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming Poster Session 5
Hao Lin ⋅ Ke Wu ⋅ Jie Li ⋅ Jun Li ⋅ Wu-Jun Li
ExHall D Poster #214
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model Poster Session 4
Xiang Gao ⋅ Shuai Yang ⋅ Jiaying Liu
ExHall D Poster #233
CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image Poster Session 3
Jingshun Huang ⋅ Haitao Lin ⋅ Tianyu Wang ⋅ Yanwei Fu ⋅ Xiangyang Xue ⋅ Yi Zhu
ExHall D Poster #97
Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models? Poster Session 4
Yanbo Wang ⋅ Jiyang Guan ⋅ Jian Liang ⋅ Ran He
ExHall D Poster #388
Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater Poster Session 1
Xueyu Liu ⋅ Rui Wang ⋅ Yexin Lai ⋅ Guangze Shi ⋅ Feixue Shao ⋅ Fang Hao ⋅ Jianan Zhang ⋅ Jia Shen ⋅ Yongfei Wu ⋅ Wen Zheng
ExHall D Poster #400
Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization Poster Session 3
Long Xu ⋅ Jiakai Wang ⋅ Haojie Hao ⋅ Haotong Qin ⋅ Jiejie Zhao ⋅ Xianglong Liu
ExHall D Poster #264
PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction Poster Session 4
Sinisa Stekovic ⋅ Arslan Artykov ⋅ Stefan Ainetter ⋅ Mattia Durso ⋅ Friedrich Fraundorfer
ExHall D Poster #41
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection Poster Session 4
Divya Velayudhan ⋅ Abdelfatah Ahmed ⋅ Mohamad Alansari ⋅ Neha Gour ⋅ Abderaouf Behouch ⋅ Taimur Hassan ⋅ Syed Talal Wasim ⋅ Nabil Maalej ⋅ Muzammal Naseer ⋅ Jürgen Gall ⋅ Mohammed Bennamoun ⋅ Ernesto Damiani ⋅ Naoufel Werghi
ExHall D Poster #472
SCAP: Transductive Test-Time Adaptation via Supportive Clique-based Attribute Prompting Poster Session 6
Chenyu Zhang ⋅ Kunlun Xu ⋅ Zichen Liu ⋅ Yuxin Peng ⋅ Jiahuan Zhou
ExHall D Poster #371
Contextual AD Narration with Interleaved Multimodal Sequence Poster Session 2
Hanlin Wang ⋅ Zhan Tong ⋅ Kecheng Zheng ⋅ Yujun Shen ⋅ Limin Wang
ExHall D Poster #287
MNE-SLAM: Multi-Agent Neural SLAM for Mobile Robots Poster Session 1
Tianchen Deng ⋅ Guole Shen ⋅ Chen Xun ⋅ Shenghai Yuan ⋅ Tongxing Jin ⋅ Hongming Shen ⋅ Yanbo Wang ⋅ Jingchuan Wang ⋅ Hesheng Wang ⋅ Danwei Wang ⋅ Weidong Chen
ExHall D Poster #123
RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos Poster Session 2
Yuxin Yao ⋅ Zhi Deng ⋅ Junhui Hou
ExHall D Poster #14
TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering Poster Session 1
Chun Gu ⋅ Xiaofei Wei ⋅ Li Zhang ⋅ Xiatian Zhu
ExHall D Poster #31
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering Poster Session 1
Chengyue Huang ⋅ Brisa Maneechotesuwan ⋅ Shivang Chopra ⋅ Zsolt Kira
ExHall D Poster #356
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation Poster Session 6
Tianyi Zhu ⋅ Dongwei Ren ⋅ Qilong Wang ⋅ Xiaohe Wu ⋅ Wangmeng Zuo
ExHall D Poster #171
Exploring Temporally-Aware Features for Point Tracking Poster Session 1
Inès Hyeonsu Kim ⋅ Seokju Cho ⋅ Gabriel Huang ⋅ Jung Yi ⋅ Joon-Young Lee ⋅ Seungryong Kim
ExHall D Poster #166
GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection Poster Session 6
Dušan Malić ⋅ Christian Fruhwirth-Reisinger ⋅ Samuel Schulter ⋅ Horst Possegger
ExHall D Poster #115
Detail-Preserving Latent Diffusion for Stable Shadow Removal Poster Session 2
Jiamin Xu ⋅ Yuxin Zheng ⋅ Zelong Li ⋅ Chi Wang ⋅ Renshu Gu ⋅ Weiwei Xu ⋅ Gang Xu
ExHall D Poster #212
Floating No More: Object-Ground Reconstruction from a Single Image Poster Session 6
Yunze Man ⋅ Yichen Sheng ⋅ Jianming Zhang ⋅ Liangyan Gui ⋅ Yu-Xiong Wang
ExHall D Poster #94
CrossOver: 3D Scene Cross-Modal Alignment Poster Session 2
Sayan Deb Sarkar ⋅ Ondrej Miksik ⋅ Marc Pollefeys ⋅ Daniel Barath ⋅ Iro Armeni
ExHall D Poster #346
SKE-Layout: Spatial Knowledge Enhanced Layout Generation with LLMs Poster Session 4
Junsheng Wang ⋅ Nieqing Cao ⋅ Yan Ding ⋅ Mengying Xie ⋅ Fuqiang Gu ⋅ Chao Chen
ExHall D Poster #344
Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy Poster Session 6
Aditya Ganeshan ⋅ Thibault Groueix ⋅ Paul Guerrero ⋅ Radomir Mech ⋅ Matthew Fisher ⋅ Daniel Ritchie
ExHall D Poster #243
4D-Fly: Fast 4D Reconstruction from a Single Monocular Video Poster Session 4
Diankun Wu ⋅ Fangfu Liu ⋅ Yi-Hsin Hung ⋅ Yue Qian ⋅ Xiaohang Zhan ⋅ Yueqi Duan
ExHall D Poster #79
STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds Poster Session 6
Zikuan Li ⋅ Honghua Chen ⋅ Yuecheng Wang ⋅ Sibo Wu ⋅ Mingqiang Wei ⋅ Jun Wang
ExHall D Poster #105
Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising Poster Session 3
Yuchen Wang ⋅ Hongyuan Wang ⋅ Lizhi Wang ⋅ Xin Wang ⋅ Lin Zhu ⋅ Wanxuan Lu ⋅ Hua Huang
ExHall D Poster #194
Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation Poster Session 5
Rong Qin ⋅ Xingyu Liu ⋅ Jinglei Shi ⋅ Liang Lin ⋅ Jufeng Yang
ExHall D Poster #473
DiffLO: Semantic-Aware LiDAR Odometry with Diffusion-Based Refinement Poster Session 4
huang yongshu ⋅ Chen Liu ⋅ Minghang Zhu ⋅ Sheng Ao ⋅ Chenglu Wen ⋅ Cheng Wang
ExHall D Poster #118
pFedMxF: Personalized Federated Class-Incremental Learning with Mixture of Frequency Aggregation Poster Session 6
Yifei Zhang ⋅ Hao Zhu ⋅ Alysa Ziying Tan ⋅ Dianzhi Yu ⋅ Longtao Huang ⋅ Han Yu
ExHall D Poster #430
Style-Editor: Text-driven Object-centric Style Editing Poster Session 4
Jihun Park ⋅ Jongmin Gim ⋅ Kyoungmin Lee ⋅ Seunghun Lee ⋅ Sunghoon Im
ExHall D Poster #237
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene Poster Session 3
Tai-Yu Daniel Pan ⋅ Sooyoung Jeon ⋅ Mengdi Fan ⋅ Jinsu Yoo ⋅ Zhenyang Feng ⋅ Mark Campbell ⋅ Kilian Q Weinberger ⋅ Bharath Hariharan ⋅ Wei-Lun Chao
ExHall D Poster #133
Efficient Transfer Learning for Video-language Foundation Models Poster Session 6
Haoxing Chen ⋅ Zizheng Huang ⋅ Yan Hong ⋅ YANSHUO WANG ⋅ Zhongcai Lyu ⋅ Zhuoer Xu ⋅ Jun Lan ⋅ Zhangxuan Gu
ExHall D Poster #285
Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling Poster Session 5
Xingyu Chen ⋅ Zihao Feng ⋅ Kun Qian ⋅ Xinyu Zhang
ExHall D Poster #28
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval Poster Session 4
Leqi Shen ⋅ Guoqiang Gong ⋅ Tianxiang Hao ⋅ Tao He ⋅ Yifeng Zhang ⋅ Pengzhang Liu ⋅ Sicheng Zhao ⋅ Jungong Han ⋅ Guiguang Ding
ExHall D Poster #372
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction Poster Session 2
YUEJIAO SU ⋅ Yi Wang ⋅ Qiongyang Hu ⋅ Chuang Yang ⋅ Lap-Pui Chau
ExHall D Poster #350
MET3R: Measuring Multi-View Consistency in Generated Images Poster Session 2
Mohammad Asim ⋅ Christopher Wewer ⋅ Thomas Wimmer ⋅ Bernt Schiele ⋅ Jan Lenssen
ExHall D Poster #56
Segmenting Maxillofacial Structures in CBCT Volumes Poster Session 1
Federico Bolelli ⋅ Kevin Marchesini ⋅ Niels van Nistelrooij ⋅ Luca Lumetti ⋅ Vittorio Pipoli ⋅ Elisa Ficarra ⋅ Shankeeth Vinayahalingam ⋅ Costantino Grana
ExHall D Poster #485
3D Dental Model Segmentation with Geometrical Boundary Preserving Poster Session 2
Shufan Xi ⋅ Zexian Liu ⋅ Junlin Chang ⋅ Hongyu Wu ⋅ Xiaogang Wang ⋅ Aimin Hao
ExHall D Poster #486
Neuro-3D: Towards 3D Visual Decoding from EEG Signals Poster Session 5
Zhanqiang Guo ⋅ Jiamin Wu ⋅ Yonghao Song ⋅ Jiahui Bu ⋅ Weijian Mai ⋅ Qihao Zheng ⋅ Wanli Ouyang ⋅ Chunfeng Song
ExHall D Poster #273
VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging Poster Session 4
Yufan He ⋅ Pengfei Guo ⋅ Yucheng Tang ⋅ Andriy Myronenko ⋅ Vishwesh Nath ⋅ Ziyue Xu ⋅ Dong Yang ⋅ Can Zhao ⋅ Benjamin D. Simon ⋅ Mason Belue ⋅ Stephanie Anne Harmon ⋅ Baris Turkbey ⋅ Daguang Xu ⋅ Wenqi Li
ExHall D Poster #481
The Art of Deception: Color Visual Illusions and Diffusion Models Poster Session 4
Alexandra Gomez-Villa ⋅ Kai Wang ⋅ C.Alejandro Parraga ⋅ Bartłomiej Twardowski ⋅ Jesus Malo ⋅ Javier Vazquez-Corral ⋅ Joost van de Weijer
ExHall D Poster #273
GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting Poster Session 6
Shujuan Li ⋅ Yu-Shen Liu ⋅ Zhizhong Han
ExHall D Poster #92
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Poster Session 3
Zhiyuan Ma ⋅ Xinyue Liang ⋅ Rongyuan Wu ⋅ Xiangyu Zhu ⋅ Zhen Lei ⋅ Lei Zhang
ExHall D Poster #37
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI Poster Session 5
Won Jun Kim ⋅ Hyungjin Chung ⋅ Jaemin Kim ⋅ Sangmin Lee ⋅ Byeongsu Sim ⋅ Jong Chul Ye
ExHall D Poster #266
ZoomLDM: Latent Diffusion Model for Multi-scale Image Generation Poster Session 5
Srikar Yellapragada ⋅ Alexandros Graikos ⋅ Kostas Triaridis ⋅ Prateek Prasanna ⋅ Rajarsi Gupta ⋅ Joel Saltz ⋅ Dimitris Samaras
ExHall D Poster #229
Do Computer Vision Foundation Models Learn the Low-level Characteristics of the Human Visual System? Poster Session 4
Yancheng Cai ⋅ Fei Yin ⋅ Dounia Hammou ⋅ Rafal Mantiuk
ExHall D Poster #404
Towards RAW Object Detection in Diverse Conditions Poster Session 2
Zhong-Yu Li ⋅ Xin Jin ⋅ Bo-Yuan Sun ⋅ Chun-Le Guo ⋅ Ming-Ming Cheng
ExHall D Poster #333
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders Poster Session 4
Sizai Hou ⋅ Songze Li ⋅ Duanyi Yao
ExHall D Poster #463
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training Poster Session 1
Anjia Cao ⋅ Xing Wei ⋅ Zhiheng Ma
ExHall D Poster #373
BiLoRA: Almost-Orthogonal Parameter Spaces for Continual Learning Poster Session 5
Hao Zhu ⋅ Yifei Zhang ⋅ Junhao Dong ⋅ Piotr Koniusz
ExHall D Poster #437
DV-Matcher: Deformation-based Non-rigid Point Cloud Matching Guided by Pre-trained Visual Features Poster Session 6
Zhangquan Chen ⋅ Puhua Jiang ⋅ Ruqi Huang
ExHall D Poster #106
Reasoning Mamba: Hypergraph-Guided Region Relation Calculating for Weakly Supervised Affordance Grounding Poster Session 6
Yuxuan Wang ⋅ Aming Wu ⋅ Muli Yang ⋅ Yukuan Min ⋅ Yihang Zhu ⋅ Cheng Deng
ExHall D Poster #139
Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning Poster Session 1
Takuma Fukuda ⋅ Hiroshi Kera ⋅ Kazuhiko Kawamoto
ExHall D Poster #451
OpenSDI: Spotting Diffusion-Generated Images in the Open World Poster Session 1
Yabin Wang ⋅ Zhiwu Huang ⋅ Xiaopeng Hong
ExHall D Poster #393
Post-pre-training for Modality Alignment in Vision-Language Foundation Models Poster Session 1
Shin'ya Yamaguchi ⋅ Dewei Feng ⋅ Sekitoshi Kanai ⋅ Kazuki Adachi ⋅ Daiki Chijiwa
ExHall D Poster #390
Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention Poster Session 3
Soikat Hasan Ahmed ⋅ Jan Finkbeiner ⋅ Emre Neftci
ExHall D Poster #317
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Poster Session 1
Sumit Chaturvedi ⋅ Mengwei Ren ⋅ Yannick Hold-Geoffroy ⋅ Jingyuan Liu ⋅ Julie Dorsey ⋅ ZHIXIN SHU
ExHall D Poster #19
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Poster Session 4
Yicheng Chen ⋅ Xiangtai Li ⋅ Yining Li ⋅ Yanhong Zeng ⋅ Jianzong Wu ⋅ Xiangyu Zhao ⋅ Kai Chen
ExHall D Poster #395
BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models Poster Session 6
Zenghui Yuan ⋅ Jiawen Shi ⋅ Pan Zhou ⋅ Neil Zhenqiang Gong ⋅ Lichao Sun
ExHall D Poster #361
NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics Poster Session 6
Chenhao Li ⋅ Taishi Ono ⋅ Takeshi Uemori ⋅ Sho Nitta ⋅ Hajime Mihara ⋅ Alexander Gatto ⋅ Hajime Nagahara ⋅ Yusuke Moriuchi
ExHall D Poster #31
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers Poster Session 1
Daoyi Gao ⋅ Mohd Yawar Nihal Siddiqui ⋅ Lei Li ⋅ Angela Dai
ExHall D Poster #42
Non-Natural Image Understanding with Advancing Frequency-based Vision Encoders Poster Session 6
Wang Lin ⋅ Qingsong Wang ⋅ Yueying Feng ⋅ Shulei Wang ⋅ Tao Jin ⋅ Zhou Zhao ⋅ Fei Wu ⋅ Chang Yao ⋅ Jingyuan Chen
ExHall D Poster #346
SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving Poster Session 6
Su Sun ⋅ Cheng Zhao ⋅ Zhuoyang Sun ⋅ Yingjie Chen ⋅ Mei Chen
ExHall D Poster #127
AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models Poster Session 5
Sohan Patnaik ⋅ Rishabh Jain ⋅ Balaji Krishnamurthy ⋅ Mausoom Sarkar
ExHall D Poster #255
Enhanced then Progressive Fusion with View Graph for Multi-View Clustering Poster Session 3
Zhibin Dong ⋅ Meng Liu ⋅ Siwei Wang ⋅ KE LIANG ⋅ Yi Zhang ⋅ Suyuan Liu ⋅ Jiaqi Jin ⋅ Xinwang Liu ⋅ En Zhu
ExHall D Poster #466
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Poster Session 5
Hang Hua ⋅ Qing Liu ⋅ Lingzhi Zhang ⋅ Jing Shi ⋅ Soo Ye Kim ⋅ Zhifei Zhang ⋅ Yilin Wang ⋅ Jianming Zhang ⋅ Zhe Lin ⋅ Jiebo Luo
ExHall D Poster #357
Adaptive Non-Uniform Timestep Sampling for Accelerating Diffusion Model Training Poster Session 1
Myunsoo Kim ⋅ Donghyeon Ki ⋅ Seong-Woong Shim ⋅ Byung-Jun Lee
ExHall D Poster #225
Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features Poster Session 6
Wenhuan Huang ⋅ Yi JI ⋅ guiqian zhu ⋅ Ying Li ⋅ chunping Liu
ExHall D Poster #315
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea Poster Session 6
Qifan Yu ⋅ Wei Chow ⋅ Zhongqi Yue ⋅ Kaihang Pan ⋅ Yang Wu ⋅ Xiaoyang Wan ⋅ Juncheng Li ⋅ Siliang Tang ⋅ Hanwang Zhang ⋅ Yueting Zhuang
ExHall D Poster #214
Compass Control: Multi Object Orientation Control for Text-to-Image Generation Poster Session 1
Rishubh Parihar ⋅ Vaibhav Agrawal ⋅ Sachidanand VS ⋅ R. Venkatesh Babu
ExHall D Poster #252
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection Poster Session 4
Farzad Beizaee ⋅ Gregory A. Lodygensky ⋅ Christian Desrosiers ⋅ Jose Dolz
ExHall D Poster #314
Continuous 3D Perception Model with Persistent State Poster Session 3
Qianqian Wang ⋅ Yifei Zhang ⋅ Aleksander Holynski ⋅ Alexei A. Efros ⋅ Angjoo Kanazawa
ExHall D Poster #77
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation Poster Session 2
Yiheng Li ⋅ Yang Yang ⋅ Zichang Tan ⋅ Huan Liu ⋅ Weihua Chen ⋅ Xu Zhou ⋅ Zhen Lei
ExHall D Poster #369
DNF: Unconditional 4D Generation with Dictionary-based Neural Fields Poster Session 6
Xinyi Zhang ⋅ Naiqi Li ⋅ Angela Dai
ExHall D Poster #12
Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging Poster Session 3
Bo Wang ⋅ Dingwei Tan ⋅ Yen-Ling Kuo ⋅ Zhaowei Sun ⋅ Jeremy M Wolfe ⋅ Tat-Jen Cham ⋅ Mengmi Zhang
ExHall D Poster #398
ARM: Appearance Reconstruction Model for Relightable 3D Generation Poster Session 5
Xiang Feng ⋅ Chang Yu ⋅ Zoubin Bi ⋅ Yintong Shang ⋅ Feng Gao ⋅ Hongzhi Wu ⋅ Kun Zhou ⋅ Chenfanfu Jiang ⋅ Yin Yang
ExHall D Poster #36
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels Poster Session 5
Yongshuo Zong ⋅ Qin ZHANG ⋅ DONGSHENG An ⋅ Zhihua Li ⋅ Xiang Xu ⋅ Linghan Xu ⋅ Zhuowen Tu ⋅ Yifan Xing ⋅ Onkar Dabeer
ExHall D Poster #345
Structure-from-Motion with a Non-Parametric Camera Model Poster Session 1
Yihan Wang ⋅ Linfei Pan ⋅ Marc Pollefeys ⋅ Viktor Larsson
ExHall D Poster #81
EventPSR: Surface Normal and Reflectance Estimation from Photometric Stereo Using an Event Camera Poster Session 3
Bohan Yu ⋅ Jin Han ⋅ Boxin Shi ⋅ Imari Sato
ExHall D Poster #73
LAL: Enhancing 3D Human Motion Prediction with Latency-aware Auxiliary Learning Poster Session 2
Xiaoning Sun ⋅ Dong Wei ⋅ Huaijiang Sun ⋅ Shengxiang Hu
ExHall D Poster #167
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects Poster Session 2
Jaeguk Kim ⋅ Jaewoo Park ⋅ Keuntek Lee ⋅ Nam Ik Cho
ExHall D Poster #102
Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions Poster Session 1
Ting-Hsuan Liao ⋅ Yi Zhou ⋅ Yu Shen ⋅ Chun-Hao P. Huang ⋅ Saayan Mitra ⋅ Jia-Bin Huang ⋅ Uttaran Bhattacharya
ExHall D Poster #162
Generating 3D-Consistent Videos from Unposed Internet Photos Poster Session 6
Gene Chou ⋅ Kai Zhang ⋅ Sai Bi ⋅ Hao Tan ⋅ Zexiang Xu ⋅ Fujun Luan ⋅ Bharath Hariharan ⋅ Noah Snavely
ExHall D Poster #168
GRAE-3DMOT: Geometry Relation-Aware Encoder for Online 3D Multi-Object Tracking Poster Session 3
Hyunseop Kim ⋅ Hyo-Jun Lee ⋅ Yonguk Lee ⋅ Jinu Lee ⋅ Hanul Kim ⋅ Yeong Jun Koh
ExHall D Poster #101
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression Poster Session 3
Xiaoyi Qu ⋅ David Aponte ⋅ Colby Banbury ⋅ Daniel Robinson ⋅ Tianyu Ding ⋅ Kazuhito Koishida ⋅ Ilya Zharkov ⋅ Tianyi Chen
ExHall D Poster #439
LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields Poster Session 1
Zhengqin Li ⋅ Dilin Wang ⋅ Ka chen ⋅ Zhaoyang Lv ⋅ Thu Nguyen-Phuoc ⋅ Milim Lee ⋅ Jia-Bin Huang ⋅ Lei Xiao ⋅ Yufeng Zhu ⋅ Carl Marshall ⋅ Carl Ren ⋅ Richard Newcombe ⋅ Zhao Dong
ExHall D Poster #32
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models Poster Session 1
Jinho Jeong ⋅ Sangmin Han ⋅ Jinwoo Kim ⋅ Seon Joo Kim
ExHall D Poster #206
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Poster Session 5
Ziqiao Peng ⋅ Yanbo Fan ⋅ Haoyu Wu ⋅ Xuan Wang ⋅ Hongyan Liu ⋅ Jun He ⋅ Zhaoxin Fan
ExHall D Poster #1
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation Poster Session 3
Jinnan Chen ⋅ Lingting Zhu ⋅ Zeyu HU ⋅ Shengju Qian ⋅ Yugang Chen ⋅ Xin Wang ⋅ Gim Hee Lee
ExHall D Poster #41
Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics Poster Session 6
Yair Smadar ⋅ Assaf Hoogi
ExHall D Poster #385
Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects Poster Session 4
Shalini Maiti ⋅ Lourdes Agapito ⋅ Filippos Kokkinos
ExHall D Poster #265
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation Poster Session 4
Bo-Wen Yin ⋅ Jiao-Long Cao ⋅ Ming-Ming Cheng ⋅ Qibin Hou
ExHall D Poster #338
GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation Poster Session 2
Ruihai Wu ⋅ Ziyu Zhu ⋅ Yuran Wang ⋅ Yue Chen ⋅ Jiarui Wang ⋅ Hao Dong
ExHall D Poster #151
GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling Poster Session 4
Yang Zheng ⋅ Menglei Chai ⋅ Delio Vicini ⋅ Yuxiao Zhou ⋅ Yinghao Xu ⋅ Leonidas Guibas ⋅ Gordon Wetzstein ⋅ Thabo Beeler
ExHall D Poster #17
Improving Editability in Image Generation with Layer-wise Memory Poster Session 2
Daneul Kim ⋅ Jaeah Lee ⋅ Jaesik Park
ExHall D Poster #241
DIO: Decomposable Implicit 4D Occupancy-Flow World Model Poster Session 6
Christopher Diehl ⋅ Quinlan Sykora ⋅ Ben Agro ⋅ Thomas Gilles ⋅ Sergio Casas ⋅ Raquel Urtasun
ExHall D Poster #124
A Flag Decomposition for Hierarchical Datasets Poster Session 4
Nathan Mankovich ⋅ Ignacio Santamaria ⋅ Gustau Camps-Valls ⋅ Tolga Birdal
ExHall D Poster #282
RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions Poster Session 3
Shihang Du ⋅ Sanqing Qu ⋅ Tianhang Wang ⋅ Xudong Zhang ⋅ Yunwei Zhu ⋅ Jian Mao ⋅ Fan Lu ⋅ Qiao Lin ⋅ Guang Chen
ExHall D Poster #122
Olympus: A Universal Task Router for Computer Vision Tasks Poster Session 3
Yuanze Lin ⋅ Yunsheng Li ⋅ Dongdong Chen ⋅ Weijian Xu ⋅ Ronald Clark ⋅ Philip H.S. Torr
ExHall D Poster #343
ACL: Activating Capability of Linear Attention for Image Restoration Poster Session 4
Yubin Gu ⋅ Yuan Meng ⋅ Jiayi Ji ⋅ Xiaoshuai Sun
ExHall D Poster #201
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration Poster Session 6
Sudarshan Rajagopalan ⋅ Nithin Gopalakrishnan Nair ⋅ Jay Paranjape ⋅ Vishal M. Patel
ExHall D Poster #189
Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction Poster Session 2
Wenke Xia ⋅ Ruoxuan Feng ⋅ Dong Wang ⋅ Di Hu
ExHall D Poster #154
The Power of Context: How Multimodality Improves Image Super-Resolution Poster Session 5
Kangfu Mei ⋅ Vishal M. Patel ⋅ Mojtaba Sahraee-Ardakan ⋅ Hossein Talebi ⋅ Peyman Milanfar ⋅ Mauricio Delbracio
ExHall D Poster #198
MARBLE: Material Recomposition and Blending in CLIP-Space Poster Session 3
Ta-Ying Cheng ⋅ Prafull Sharma ⋅ Mark Boss ⋅ Varun Jampani
ExHall D Poster #230
EventFly: Event Camera Perception from Ground to the Sky Poster Session 1
Lingdong Kong ⋅ Dongyue Lu ⋅ Xiang Xu ⋅ Lai Xing Ng ⋅ Wei Tsang Ooi ⋅ Benoit Cottereau
ExHall D Poster #122
Detect Any Mirrors: Boosting Learning Reliability on Large-Scale Unlabeled Data with an Iterative Data Engine Poster Session 5
Zhaohu Xing ⋅ Lihao Liu ⋅ Yijun Yang ⋅ Hongqiu Wang ⋅ Tian Ye ⋅ Sixiang Chen ⋅ Wenxue Li ⋅ Guang Liu ⋅ Lei Zhu
ExHall D Poster #423
CH3Depth: Efficient and Flexible Depth Foundation Model with Flow Matching Poster Session 2
Jiaqi Li ⋅ Yiran Wang ⋅ Jinghong Zheng ⋅ Junrui Zhang ⋅ Liao Shen ⋅ Tianqi Liu ⋅ Zhiguo Cao
ExHall D Poster #178
ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics Poster Session 6
Junchao Zhu ⋅ Ruining Deng ⋅ Tianyuan Yao ⋅ Juming Xiong ⋅ Chongyu Qu ⋅ Junlin Guo ⋅ Siqi Lu ⋅ Mengmeng Yin ⋅ Yu Wang ⋅ Shilin Zhao ⋅ Haichun Yang ⋅ Yuankai Huo
ExHall D Poster #448
Efficient Visual State Space Model for Image Deblurring Poster Session 3
Lingshun Kong ⋅ Jiangxin Dong ⋅ Jinhui Tang ⋅ Ming-Hsuan Yang ⋅ Jinshan Pan
ExHall D Poster #197
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Poster Session 5
Wanhua Li ⋅ Renping Zhou ⋅ Jiawei Zhou ⋅ Yingwei Song ⋅ Johannes Herter ⋅ Minghan Qin ⋅ Gao Huang ⋅ Hanspeter Pfister
ExHall D Poster #91
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking Poster Session 2
Xinqi Liu ⋅ Li Zhou ⋅ Zikun Zhou ⋅ Jianqiu Chen ⋅ Zhenyu He
ExHall D Poster #321
Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels Poster Session 3
Pierre Vuillecard ⋅ Jean-marc Odobez
ExHall D Poster #273
MotionMap: Representing Multimodality in Human Pose Forecasting Poster Session 5
Reyhaneh Hosseininejad ⋅ Megh Shukla ⋅ Saeed Saadatnejad ⋅ Mathieu Salzmann ⋅ Alex Alahi
ExHall D Poster #154
Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework Poster Session 2
Hanrui Zhao ⋅ Niuniu Qi ⋅ Mengxin Ren ⋅ Banglong Liu ⋅ Shuming Shi ⋅ Zhengfeng Yang
ExHall D Poster #467
GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting Poster Session 6
Yangming Zhang ⋅ Wenqi Jia ⋅ Wei Niu ⋅ Miao Yin
ExHall D Poster #49
Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views Poster Session 3
Jiang Wu ⋅ Rui Li ⋅ Yu Zhu ⋅ Rong Guo ⋅ Jinqiu Sun ⋅ Yanning Zhang
ExHall D Poster #62
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis Poster Session 6
Yun Chang ⋅ Leonor Fermoselle ⋅ Duy Ta ⋅ Bernadette Bucher ⋅ Luca Carlone ⋅ Jiuguang Wang
ExHall D Poster #316
Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model Poster Session 1
Ziyuan Yang ⋅ Yingyu Chen ⋅ Zhiwen Wang ⋅ Hongming Shan ⋅ Yang Chen ⋅ Yi Zhang
ExHall D Poster #477
Exploiting Deblurring Networks for Radiance Fields Poster Session 2
Haeyun Choi ⋅ Heemin Yang ⋅ Janghyeok Han ⋅ Sunghyun Cho
ExHall D Poster #54
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection Poster Session 2
Yifan Chang ⋅ Junjie Huang ⋅ Xiaofeng Wang ⋅ Yun Ye ⋅ Zhujin LIANG ⋅ Yi Shan ⋅ Dalong Du ⋅ Xingang Wang
ExHall D Poster #137
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Poster Session 4
Zixuan Huang ⋅ Mark Boss ⋅ Aaryaman Vasishta ⋅ James Rehg ⋅ Varun Jampani
ExHall D Poster #99
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models Poster Session 6
Yan Xie ⋅ Zequn Zeng ⋅ Hao Zhang ⋅ Yucheng Ding ⋅ Yi Wang ⋅ Zhengjue Wang ⋅ Bo Chen ⋅ Hongwei Liu
ExHall D Poster #388
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation Poster Session 4
Xiaoying Xing ⋅ Avinab Saha ⋅ Junfeng He ⋅ Susan Hao ⋅ Paul Vicol ⋅ Moonkyung Ryu ⋅ Gang Li ⋅ Sahil Singla ⋅ Sarah Young ⋅ Yinxiao Li ⋅ Feng Yang ⋅ Deepak Ramachandran
ExHall D Poster #259
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Poster Session 2
Barza Nisar ⋅ Steven L. Waslander
ExHall D Poster #124
Bringing CLIP to the Clinic: Dynamic Soft Labels and Negation-Aware Learning for Medical Analysis Poster Session 5
Hanbin Ko ⋅ Chang Min Park
ExHall D Poster #467
Label Shift Meets Online Learning: Ensuring Consistent Adaptation with Universal Dynamic Regret Poster Session 3
Yucong Dai ⋅ Shilin Gu ⋅ Ruidong Fan ⋅ Chao Xu ⋅ Chenping Hou
ExHall D Poster #454
A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation Poster Session 5
Zheng Zhang ⋅ Guanchun Yin ⋅ Bo Zhang ⋅ Wu Liu ⋅ Xiuzhuang Zhou ⋅ Wendong Wang
ExHall D Poster #471
ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling Poster Session 1
Zikang Zhou ⋅ Hengjian Zhou ⋅ Haibo Hu ⋅ Zihao WEN ⋅ Jianping Wang ⋅ Yung-Hui Li ⋅ Yu-Kai Huang
ExHall D Poster #135
Quaffure: Real-Time Quasi-Static Neural Hair Simulation Poster Session 1
Tuur Stuyck ⋅ Gene Wei-Chin Lin ⋅ Egor Larionov ⋅ Hsiaoyu Chen ⋅ Aljaž Božič ⋅ Nikolaos Sarafianos ⋅ Doug Roble
ExHall D Poster #7
From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport Poster Session 5
Quentin Bouniot ⋅ Ievgen Redko ⋅ Anton Mallasto ⋅ Charlotte Laclau ⋅ Oliver Struckmeier ⋅ Karol Arndt ⋅ Markus Heinonen ⋅ Ville Kyrki ⋅ Samuel Kaski
ExHall D Poster #402
DepthSplat: Connecting Gaussian Splatting and Depth Poster Session 4
Haofei Xu ⋅ Songyou Peng ⋅ Fangjinhua Wang ⋅ Hermann Blum ⋅ Daniel Barath ⋅ Andreas Geiger ⋅ Marc Pollefeys
ExHall D Poster #58
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models Poster Session 6
Haokun Chen ⋅ Hang Li ⋅ Yao Zhang ⋅ Jinhe Bi ⋅ Gengyuan Zhang ⋅ Yueqi Zhang ⋅ Philip H.S. Torr ⋅ Jindong Gu ⋅ Denis Krompaß ⋅ Volker Tresp
ExHall D Poster #411
Dynamic Camera Poses and Where to Find Them Poster Session 3
Chris Rockwell ⋅ Joseph Tung ⋅ Tsung-Yi Lin ⋅ Ming-Yu Liu ⋅ David Fouhey ⋅ Chen-Hsuan Lin
ExHall D Poster #171
GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation Poster Session 6
Weihang Li ⋅ Hongli XU ⋅ Junwen Huang ⋅ HyunJun Jung ⋅ Kuan-Ting Yu ⋅ Nassir Navab ⋅ Benjamin Busam
ExHall D Poster #96
OmniGen: Unified Image Generation Poster Session 3
Shitao Xiao ⋅ Yueze Wang ⋅ Junjie Zhou ⋅ Huaying Yuan ⋅ Xingrun Xing ⋅ Ruiran Yan ⋅ Chaofan Li ⋅ Shuting Wang ⋅ Tiejun Huang ⋅ Zheng Liu
ExHall D Poster #252
QuCOOP: A Versatile Framework for Solving Composite and Binary-Parametrised Problems on Quantum Annealers Poster Session 3
Natacha Kuete Meli ⋅ Vladislav Golyanik ⋅ Marcel Seelbach Benkner ⋅ Michael Moeller
ExHall D Poster #70
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images Poster Session 1
Rong Wang ⋅ Fabian Prada ⋅ Ziyan Wang ⋅ Zhongshi Jiang ⋅ Chengxiang Yin ⋅ Junxuan Li ⋅ Shunsuke Saito ⋅ Igor Santesteban ⋅ Javier Romero ⋅ Rohan Joshi ⋅ Hongdong Li ⋅ Jason Saragih ⋅ Yaser Sheikh
ExHall D Poster #11
ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning Poster Session 1
Zhenyang Liu ⋅ Yikai Wang ⋅ Sixiao Zheng ⋅ Tongying Pan ⋅ Longfei Liang ⋅ Yanwei Fu ⋅ Xiangyang Xue
ExHall D Poster #338
Cropper: Vision-Language Model for Image Cropping through In-Context Learning Poster Session 6
Seung Hyun Lee ⋅ Jijun jiang ⋅ Yiran Xu ⋅ Zhuofang Li ⋅ Junjie Ke ⋅ Yinxiao Li ⋅ Junfeng He ⋅ Steven Hickson ⋅ Katie Datsenko ⋅ Sangpil Kim ⋅ Ming-Hsuan Yang ⋅ Irfan Essa ⋅ Feng Yang
ExHall D Poster #369
Advancing Adversarial Robustness in GNeRFs: The IL2-NeRF Attack Poster Session 4
Nicole Meng ⋅ Caleb Manicke ⋅ Ronak Sahu ⋅ Caiwen Ding ⋅ Yingjie Lao
ExHall D Poster #52
Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization Poster Session 4
You Shen ⋅ Zhipeng Zhang ⋅ Xinyang Li ⋅ Yansong Qu ⋅ Yu Lin ⋅ Shengchuan Zhang ⋅ Liujuan Cao
ExHall D Poster #48
Language-Guided Salient Object Ranking Poster Session 6
Fang Liu ⋅ Yuhao Liu ⋅ Ke Xu ⋅ Shuquan Ye ⋅ Gerhard Hancke ⋅ Rynson W.H. Lau
ExHall D Poster #350
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation Poster Session 3
Weinan Jia ⋅ Mengqi Huang ⋅ Nan Chen ⋅ Lei Zhang ⋅ Zhendong Mao
ExHall D Poster #211
Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications Poster Session 5
Tong Bu ⋅ Maohua Li ⋅ Zhaofei Yu
ExHall D Poster #321
NoiseCtrl: A Sampling-Algorithm-Agnostic Conditional Generation Method for Diffusion Models Poster Session 4
Longquan Dai ⋅ He Wang ⋅ Jinhui Tang
ExHall D Poster #218
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation Poster Session 6
Tanner Schmidt ⋅ Richard Newcombe
ExHall D Poster #313
TCFG: Tangential Damping Classifier-free Guidance Poster Session 1
Mingi Kwon ⋅ Shin seong Kim ⋅ Jaeseok Jeong ⋅ Yi-Ting Hsiao ⋅ Youngjung Uh
ExHall D Poster #235
AutoPresent: Designing Structured Visuals from Scratch Poster Session 1
Jiaxin Ge ⋅ Zora Zhiruo Wang ⋅ Xuhui Zhou ⋅ Yi-Hao Peng ⋅ Sanjay Subramanian ⋅ Qinyue Tan ⋅ Maarten Sap ⋅ Alane Suhr ⋅ Daniel Fried ⋅ Graham Neubig ⋅ Trevor Darrell
ExHall D Poster #262
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models Poster Session 5
Chenyu Yang ⋅ Xuan Dong ⋅ Xizhou Zhu ⋅ Weijie Su ⋅ Jiahao Wang ⋅ Hao Tian ⋅ Zhe Chen ⋅ Wenhai Wang ⋅ Lewei Lu ⋅ Jifeng Dai
ExHall D Poster #373
KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities Poster Session 3
Tianyi Liu ⋅ Haochuan Jiang ⋅ Kaizhu Huang
ExHall D Poster #480
LongDiff: Training-Free Long Video Generation in One Go Poster Session 4
Zhuoling Li ⋅ Hossein Rahmani ⋅ Qiuhong Ke ⋅ Jun Liu
ExHall D Poster #189
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation Poster Session 5
Yuyang Peng ⋅ Shishi Xiao ⋅ Keming Wu ⋅ Qisheng Liao ⋅ Bohan CHEN ⋅ Kevin Lin ⋅ Danqing Huang ⋅ Ji Li ⋅ Yuhui Yuan
ExHall D Poster #247
LSNet: See Large, Focus Small Poster Session 2
Ao Wang ⋅ Hui Chen ⋅ Zijia Lin ⋅ Jungong Han ⋅ Guiguang Ding
ExHall D Poster #414
DORNet: A Degradation Oriented and Regularized Network for Blind Depth Super-Resolution Poster Session 4
Zhengxue Wang ⋅ Zhiqiang Yan ⋅ Jinshan Pan ⋅ Guangwei Gao ⋅ Kai Zhang ⋅ Jian Yang
ExHall D Poster #46
Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling Poster Session 2
Haopeng Sun ⋅ Yingwei Zhang ⋅ Lumin Xu ⋅ Sheng Jin ⋅ Ping Luo ⋅ Chen Qian ⋅ Wentao Liu ⋅ Yiqiang Chen
ExHall D Poster #453
Using Diffusion Priors for Video Amodal Segmentation Poster Session 5
Kaihua Chen ⋅ Deva Ramanan ⋅ Tarasha Khurana
ExHall D Poster #174
Augmented Deep Contexts for Spatially Embedded Video Coding Poster Session 1
Yifan Bian ⋅ Chuanbo Tang ⋅ Li Li ⋅ Dong Liu
ExHall D Poster #181
Homogeneous Dynamics Space for Heterogeneous Humans Poster Session 6
Xinpeng Liu ⋅ Junxuan Liang ⋅ Chenshuo Zhang ⋅ Zixuan Cai ⋅ Cewu Lu ⋅ Yonglu Li
ExHall D Poster #154
GazeGene: Large-scale Synthetic Gaze Dataset with 3D Eyeball Annotations Poster Session 4
Yiwei Bao ⋅ Zhiming Wang ⋅ Feng Lu
ExHall D Poster #283
ImViD: Immersive Volumetric Videos for Enhanced VR Engagement Poster Session 4
Zhengxian Yang ⋅ Shi Pan ⋅ Shengqi Wang ⋅ Haoxiang Wang ⋅ Li Lin ⋅ Guanjun Li ⋅ Zhengqi Wen ⋅ Borong Lin ⋅ Jianhua Tao ⋅ Tao Yu
ExHall D Poster #69
Real-IAD D³: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection Poster Session 3
wenbing zhu ⋅ Lidong Wang ⋅ Ziqing Zhou ⋅ Chengjie Wang ⋅ Yurui Pan ⋅ Ruoyi.Zhang ⋅ Zhuhao Chen ⋅ Linjie Cheng ⋅ Bin-Bin Gao ⋅ Jiangning Zhang ⋅ Zhenye Gan ⋅ Yuxie Wang ⋅ Yulong Chen ⋅ Bruce Qian ⋅ Mingmin Chi ⋅ Bo Peng ⋅ Lizhuang Ma
ExHall D Poster #437
Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability Poster Session 6
Jianyang Zhang ⋅ Qianli Luo ⋅ Guowu Yang ⋅ Wenjing Yang ⋅ Weide Liu ⋅ Guosheng Lin ⋅ Fengmao Lv
ExHall D Poster #397
FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones Poster Session 3
Manfred Georg ⋅ Garrett Tanzer ⋅ Esha Uboweja ⋅ Saad Hassan ⋅ Maximus Shengelia ⋅ Sam Sepah ⋅ Sean Forbes ⋅ Thad Starner
ExHall D Poster #310
Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization Poster Session 4
Zhipeng Xu ⋅ De Cheng ⋅ XINYANG JIANG ⋅ Nannan Wang ⋅ Dongsheng Li ⋅ Xinbo Gao
ExHall D Poster #268
Full-DoF Egomotion Estimation for Event Cameras Using Geometric Solvers Poster Session 3
Ji Zhao ⋅ Banglei Guan ⋅ Zibin Liu ⋅ Laurent Kneip
ExHall D Poster #83
PERSE: Personalized 3D Generative Avatars from A Single Portrait Poster Session 4
Hyunsoo Cha ⋅ Inhee Lee ⋅ Hanbyul Joo
ExHall D Poster #9
Animate and Sound an Image Poster Session 5
Xihua Wang ⋅ Ruihua Song ⋅ Chongxuan Li ⋅ Xin Cheng ⋅ Boyuan Li ⋅ Yihan Wu ⋅ Yuyue Wang ⋅ Hongteng Xu ⋅ Yunfeng Wang
ExHall D Poster #221
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation Poster Session 5
Jungsoo Lee ⋅ Debasmit Das ⋅ Munawar Hayat ⋅ Sungha Choi ⋅ Kyuwoong Hwang ⋅ Fatih Porikli
ExHall D Poster #395
Volume Tells: Dual Cycle-Consistent Diffusion for 3D Fluorescence Microscopy De-noising and Super-Resolution Poster Session 4
ZELIN LI ⋅ Chenwei Wang ⋅ Zhaoke Huang ⋅ Centre for Intelligent Multidimensional Data Analysis ⋅ Hong Kong Baptist University ⋅ Hong Kong Baptist University ⋅ Hong Kong Baptist University
ExHall D Poster #23
SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models Poster Session 2
Zilan Wang ⋅ Junfeng Guo ⋅ Jiacheng Zhu ⋅ Yiming Li ⋅ Heng Huang ⋅ Muhao Chen ⋅ Zhengzhong Tu
ExHall D Poster #271
Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning Poster Session 1
Xueyi Ke ⋅ Satoshi Tsutsui ⋅ Yayun Zhang ⋅ Bihan Wen
ExHall D Poster #401
GauSTAR: Gaussian Surface Tracking and Reconstruction Poster Session 4
Chengwei Zheng ⋅ Lixin Xue ⋅ Juan Jose Zarate ⋅ Jie Song
ExHall D Poster #67
Latent Space Imaging Poster Session 6
Matheus Souza ⋅ Yidan Zheng ⋅ Kaizhang Kang ⋅ Yogeshwar Nath Mishra ⋅ Qiang Fu ⋅ Wolfgang Heidrich
ExHall D Poster #203
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion Poster Session 6
Zhaoxi Chen ⋅ Jiaxiang Tang ⋅ Yuhao Dong ⋅ Ziang Cao ⋅ Fangzhou Hong ⋅ Yushi Lan ⋅ Tengfei Wang ⋅ Haozhe Xie ⋅ Tong Wu ⋅ Shunsuke Saito ⋅ Liang Pan ⋅ Dahua Lin ⋅ Ziwei Liu
ExHall D Poster #40
Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization Poster Session 2
Peirong Liu ⋅ Ana Lawry Aguila ⋅ Juan Iglesias
ExHall D Poster #484
Differentiable Inverse Rendering with Interpretable Basis BRDFs Poster Session 1
Hoon-Gyu Chung ⋅ Seokjun Choi ⋅ Seung-Hwan Baek
ExHall D Poster #29
Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation Poster Session 2
Hongmei Yin ⋅ Tingliang Feng ⋅ Fan Lyu ⋅ Fanhua Shang ⋅ Hongying Liu ⋅ Wei Feng ⋅ Liang Wan
ExHall D Poster #425
ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion Poster Session 1
Nissim Maruani ⋅ Wang Yifan ⋅ Matthew Fisher ⋅ Pierre Alliez ⋅ Mathieu Desbrun
ExHall D Poster #41
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Poster Session 6
ziang yan ⋅ Zhilin Li ⋅ Yinan He ⋅ Chenting Wang ⋅ Kunchang Li ⋅ Xinhao Li ⋅ Xiangyu Zeng ⋅ Zilei Wang ⋅ Yali Wang ⋅ Yu Qiao ⋅ Limin Wang ⋅ Yi Wang
ExHall D Poster #357
Integral Fast Fourier Color Constancy Poster Session 6
Wenjun Wei ⋅ Yanlin Qian ⋅ Huaian Chen ⋅ Junkang Dai ⋅ Yi Jin
ExHall D Poster #22
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians Poster Session 6
Jiamin WU ⋅ Kenkun Liu ⋅ Han Gao ⋅ Xiaoke Jiang ⋅ Yuan Yao ⋅ Lei Zhang
ExHall D Poster #46
DKC: Differentiated Knowledge Consolidation for Cloth-Hybrid Lifelong Person Re-identification Poster Session 1
Zhenyu Cui ⋅ Jiahuan Zhou ⋅ Yuxin Peng
ExHall D Poster #324
See Further When Clear: Curriculum Consistency Model Poster Session 4
Yunpeng Liu ⋅ Boxiao Liu ⋅ Yi Zhang ⋅ Xingzhong Hou ⋅ Guanglu Song ⋅ Yu Liu ⋅ Haihang You
ExHall D Poster #219
CaMuViD: Calibration-Free Multi-View Detection Poster Session 1
Amir Etefaghi Daryani ⋅ M. Usman Maqbool Bhutta ⋅ Byron Hernandez ⋅ Henry Medeiros
ExHall D Poster #98
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks Poster Session 6
Wei-Jin Huang ⋅ Yuan-Ming Li ⋅ Zhi-Wei Xia ⋅ Yu-Ming Tang ⋅ Kun-Yu Lin ⋅ Jian-Fang Hu ⋅ Wei-Shi Zheng
ExHall D Poster #155
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction Poster Session 1
Dubing Chen ⋅ Huan Zheng ⋅ Jin Fang ⋅ Xingping Dong ⋅ Xianfei Li ⋅ Wenlong Liao ⋅ Tao He ⋅ Pai Peng ⋅ Jianbing Shen
ExHall D Poster #125
SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations Poster Session 1
Krispin Wandel ⋅ Hesheng Wang
ExHall D Poster #90
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation Poster Session 1
Ziyang Xie ⋅ Zhizheng Liu ⋅ Zhenghao Peng ⋅ Wayne Wu ⋅ Bolei Zhou
ExHall D Poster #132
A Unified Latent Schrödinger Bridge Diffusion Model for Unsupervised Anomaly Detection and Localization Poster Session 5
Shilhora Akshay ⋅ Niveditha Lakshmi Narasimhan ⋅ Jacob George ⋅ Vineeth Balasubramanian
ExHall D Poster #429
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly Poster Session 2
Yexin Liu ⋅ Zhengyang Liang ⋅ Yueze Wang ⋅ Xianfeng Wu ⋅ feilong tang ⋅ Muyang He ⋅ Jian Li ⋅ Zheng Liu ⋅ Harry Yang ⋅ Ser-Nam Lim ⋅ Bo Zhao
ExHall D Poster #355
Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning Poster Session 2
Dongyao Jiang ⋅ Haodong Jing ⋅ Yongqiang Ma ⋅ Nanning Zheng
ExHall D Poster #427
Towards Efficient Foundation Model for Zero-shot Amodal Segmentation Poster Session 4
Zhaochen Liu ⋅ Limeng Qiao ⋅ Xiangxiang Chu ⋅ Lin Ma ⋅ Tingting Jiang
ExHall D Poster #424
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction Poster Session 2
Gehui Li ⋅ Bin Chen ⋅ Chen Zhao ⋅ Lei Zhang ⋅ Jian Zhang
ExHall D Poster #202
Synthetic Data is an Elegant GIFT for Continual Vision-Language Models Poster Session 1
Bin Wu ⋅ Wuxuan Shi ⋅ Jinqiao Wang ⋅ Mang Ye
ExHall D Poster #254
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception Poster Session 1
Yunpeng Qu ⋅ Kun Yuan ⋅ Qizhi Xie ⋅ Ming Sun ⋅ Chao Zhou ⋅ Jian Wang
ExHall D Poster #186
ProbeSDF: Light Field Probes For Neural Surface Reconstruction Poster Session 3
Briac Toussaint ⋅ Diego Thomas ⋅ Jean-Sébastien Franco
ExHall D Poster #36
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation Poster Session 2
Silin Cheng ⋅ Yang Liu ⋅ Xinwei He ⋅ Sebastien Ourselin ⋅ Lei Tan ⋅ Luo
ExHall D Poster #363
Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection Poster Session 4
Jiahao Xu ⋅ Zikai Zhang ⋅ Rui Hu
ExHall D Poster #461
Learning Flow Fields in Attention for Controllable Person Image Generation Poster Session 1
Zijian Zhou ⋅ Shikun Liu ⋅ Xiao Han ⋅ Haozhe Liu ⋅ Kam Woh Ng ⋅ Tian Xie ⋅ Yuren Cong ⋅ Hang Li ⋅ Mengmeng Xu ⋅ Juan-Manuel Pérez-Rúa ⋅ Aditya Patel ⋅ Tao Xiang ⋅ Miaojing Shi ⋅ Sen He
ExHall D Poster #223
CacheQuant: Comprehensively Accelerated Diffusion Models Poster Session 5
Xuewen Liu ⋅ Zhikai Li ⋅ Qingyi Gu
ExHall D Poster #211
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning Poster Session 4
Buzhen Huang ⋅ Chen Li ⋅ Chongyang Xu ⋅ Dongyue Lu ⋅ Jinnan Chen ⋅ Yangang Wang ⋅ Gim Hee Lee
ExHall D Poster #160
Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment Poster Session 1
Xudong Li ⋅ Wenjie Nie ⋅ Yan Zhang ⋅ Runze Hu ⋅ Ke Li ⋅ Xiawu Zheng ⋅ Liujuan Cao
ExHall D Poster #205
Generalizing Deepfake Video Detection with Plug-and-Play: Video-Level Blending and Spatiotemporal Adapter Tuning Poster Session 3
Zhiyuan Yan ⋅ Yandan Zhao ⋅ Shen Chen ⋅ Mingyi Guo ⋅ Xinghe Fu ⋅ Taiping Yao ⋅ Shouhong Ding ⋅ Yunsheng Wu ⋅ Li Yuan
ExHall D Poster #188
D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition. Poster Session 4
Haoran Wang ⋅ Xinji Mai ⋅ Zeng Tao ⋅ Xuan Tong ⋅ Junxiong Lin ⋅ Yan Wang ⋅ Jiawen Yu ⋅ Shaoqi Yan ⋅ Ziheng Zhou ⋅ Wenqiang Zhang
ExHall D Poster #326
From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech Poster Session 4
Jihoon Kim ⋅ Jeongsoo Choi ⋅ Jaehun Kim ⋅ Chaeyoung Jung ⋅ Joon Chung
ExHall D Poster #2
FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling Poster Session 4
Hang Ye ⋅ Xiaoxuan Ma ⋅ Hai Ci ⋅ Wentao Zhu ⋅ Yizhou Wang
ExHall D Poster #12
SnowMaster: Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization Poster Session 1
Jianyu LAI ⋅ Sixiang Chen ⋅ yunlong lin ⋅ Tian Ye ⋅ Yun Liu ⋅ Song Fei ⋅ Zhaohu Xing ⋅ Hongtao Wu ⋅ Weiming Wang ⋅ Lei Zhu
ExHall D Poster #394
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension Poster Session 3
Xiaofu Chen ⋅ Yaxin Luo ⋅ Luo ⋅ Jiayi Ji ⋅ Henghui Ding ⋅ Yiyi Zhou
ExHall D Poster #353
The Impact Label Noise and Choice of Threshold has on Cross-Entropy and Soft-Dice in Image Segmentation Poster Session 4
Marcus Nordström ⋅ Atsuto Maki ⋅ Henrik Hult
ExHall D Poster #477
Open-World Objectness Modeling Unifies Novel Object Detection Poster Session 6
Shan Zhang ⋅ Yao Ni ⋅ Jinhao Du ⋅ Yuan Xue ⋅ Philip H.S. Torr ⋅ Piotr Koniusz ⋅ Anton van den Hengel
ExHall D Poster #401
LLaVA-Critic: Learning to Evaluate Multimodal Models Poster Session 3
Tianyi Xiong ⋅ Xiyao Wang ⋅ Dong Guo ⋅ Qinghao Ye ⋅ Haoqi Fan ⋅ Quanquan Gu ⋅ Heng Huang ⋅ Chunyuan Li
ExHall D Poster #283
MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond Poster Session 6
Shenghao Ren ⋅ Yi Lu ⋅ Jiayi Huang ⋅ Jiayi Zhao ⋅ He Zhang ⋅ Tao Yu ⋅ Qiu Shen ⋅ Xun Cao
ExHall D Poster #152
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion Poster Session 5
Songsong Yu ⋅ Yuxin Chen ⋅ Zhongang Qi ⋅ Zeke Xie ⋅ Yifan Wang ⋅ Lijun Wang ⋅ Ying Shan ⋅ Huchuan Lu
ExHall D Poster #76
Towards Open-Vocabulary Audio-Visual Event Localization Poster Session 2
Jinxing Zhou ⋅ Dan Guo ⋅ Ruohao Guo ⋅ Yuxin Mao ⋅ Jingjing Hu ⋅ Yiran Zhong ⋅ Xiaojun Chang ⋅ Meng Wang
ExHall D Poster #286
One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency Poster Session 4
Li Jin ⋅ Yujie Wang ⋅ Wenzheng Chen ⋅ Qiyu Dai ⋅ Qingzhe Gao ⋅ Xueying Qin ⋅ Baoquan Chen
ExHall D Poster #98
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning Poster Session 3
Hanxun Yu ⋅ Wentong Li ⋅ Song Wang ⋅ Junbo Chen ⋅ Jianke Zhu
ExHall D Poster #335
HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution Poster Session 1
Yuxuan Jiang ⋅ Ho Man Kwan ⋅ jasmine peng ⋅ Ge Gao ⋅ Fan Zhang ⋅ Xiaoqing Zhu ⋅ Joel Sole ⋅ David Bull
ExHall D Poster #200
Motion Prompting: Controlling Video Generation with Motion Trajectories Poster Session 1
Daniel Geng ⋅ Charles Herrmann ⋅ Junhwa Hur ⋅ Forrester Cole ⋅ Serena Zhang ⋅ Tobias Pfaff ⋅ Tatiana Lopez-Guevara ⋅ Yusuf Aytar ⋅ Michael Rubinstein ⋅ Chen Sun ⋅ Oliver Wang ⋅ Andrew Owens ⋅ Deqing Sun
ExHall D Poster #173
VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models Poster Session 2
Muchao Ye ⋅ Weiyang Liu ⋅ Pan He
ExHall D Poster #316
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks Poster Session 4
Erik Wallin ⋅ Fredrik Kahl ⋅ Lars Hammarstrand
ExHall D Poster #457
CCIN: Compositional Conflict Identification and Neutralization for Composed Image Retrieval Poster Session 1
Likai Tian ⋅ Jian Zhao ⋅ Zechao Hu ⋅ Zhengwei Yang ⋅ Hao Li ⋅ Lei Jin ⋅ Zheng Wang ⋅ Xuelong Li
ExHall D Poster #362
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP Poster Session 3
Songlong Xing ⋅ Zhengyu Zhao ⋅ Nicu Sebe
ExHall D Poster #433
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Poster Session 1
Meng Lou ⋅ Yizhou Yu
ExHall D Poster #395
Enhancing Adversarial Transferability with Checkpoints of a Single Model’s Training Poster Session 4
Shixin Li ⋅ Chaoxiang He ⋅ Xiaojing Ma ⋅ Bin Benjamin Zhu ⋅ Shuo Wang ⋅ Hongsheng Hu ⋅ Dongmei Zhang ⋅ Linchen Yu
ExHall D Poster #464
POSTA: A Go-to Framework for Customized Artistic Poster Generation Poster Session 6
Haoyu Chen ⋅ Xiaojie Xu ⋅ Wenbo Li ⋅ Jingjing Ren ⋅ Tian Ye ⋅ Songhua Liu ⋅ Ying-Cong Chen ⋅ Lei Zhu ⋅ Xinchao Wang
ExHall D Poster #241
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models Poster Session 6
Byung-Kwan Lee ⋅ Ryo Hachiuma ⋅ Yu-Chiang Frank Wang ⋅ Yong Man Ro ⋅ Yueh-Hua Wu
ExHall D Poster #324
Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality Poster Session 2
Liyan Chen ⋅ Gregory P. Meyer ⋅ Zaiwei Zhang ⋅ Eric M. Wolff ⋅ Paul Vernaza
ExHall D Poster #117
Task-Agnostic Guided Feature Expansion for Class-Incremental Learning Poster Session 2
Bowen Zheng ⋅ Da-Wei Zhou ⋅ Han-Jia Ye ⋅ De-Chuan Zhan
ExHall D Poster #450
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Poster Session 4
Kevin Qinghong Lin ⋅ Linjie Li ⋅ Difei Gao ⋅ Zhengyuan Yang ⋅ Shiwei Wu ⋅ Zechen Bai ⋅ Stan Weixian Lei ⋅ Lijuan Wang ⋅ Mike Zheng Shou
ExHall D Poster #352
Twinner: Shining Light on Digital Twins in a Few Snaps Poster Session 2
Jesus Zarzar ⋅ Tom Monnier ⋅ Roman Shapovalov ⋅ Andrea Vedaldi ⋅ David Novotny
ExHall D Poster #39
Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis Poster Session 4
Jian Han ⋅ Jinlai Liu ⋅ Yi Jiang ⋅ Bin Yan ⋅ Yuqi Zhang ⋅ Zehuan Yuan ⋅ BINGYUE PENG ⋅ Xiaobing Liu
ExHall D Poster #248
DreamText: High Fidelity Scene Text Synthesis Poster Session 6
Yibin Wang ⋅ Weizhong Zhang ⋅ honghui xu ⋅ Cheng Jin
ExHall D Poster #228
MVBoost: Boost 3D Reconstruction with Multi-View Refinement Poster Session 5
Xiangyu Liu ⋅ Xiaomei Zhang ⋅ Zhiyuan Ma ⋅ Xiangyu Zhu ⋅ Zhen Lei
ExHall D Poster #58
Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment Poster Session 1
Yang Bai ⋅ Yucheng Ji ⋅ Min Cao ⋅ Jinqiao Wang ⋅ Mang Ye
ExHall D Poster #360
Category-Agnostic Neural Object Rigging Poster Session 5
Guangzhao He ⋅ Chen Geng ⋅ Shangzhe Wu ⋅ Jiajun Wu
ExHall D Poster #98
AVF-MAE++: Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning Poster Session 2
Xuecheng Wu ⋅ Heli Sun ⋅ Yifan Wang ⋅ Jiayu Nie ⋅ Jie Zhang ⋅ Yabing Wang ⋅ Junxiao Xue ⋅ Liang He
ExHall D Poster #360
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding Poster Session 2
Yudong Han ⋅ Qingpei Guo ⋅ Liyuan Pan ⋅ Liu Liu ⋅ Yu Guan ⋅ Ming Yang
ExHall D Poster #299
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation Poster Session 6
Lanyun Zhu ⋅ Tianrun Chen ⋅ Qianxiong Xu ⋅ Xuanyi Liu ⋅ Deyi Ji ⋅ Haiyang Wu ⋅ De Soh Soh ⋅ Jun Liu
ExHall D Poster #391
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model Poster Session 1
Yue Han ⋅ Jiangning Zhang ⋅ Junwei Zhu ⋅ Runze Hou ⋅ Xiaozhong Ji ⋅ Chuming Lin ⋅ Xiaobin Hu ⋅ Xuezhucun Xue ⋅ Yong Liu
ExHall D Poster #359
Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency Poster Session 4
Alan Baade ⋅ Changan Chen
ExHall D Poster #89
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Poster Session 6
Ho Kei Cheng ⋅ Masato Ishii ⋅ Akio Hayakawa ⋅ Takashi Shibuya ⋅ Alexander G. Schwing ⋅ Yuki Mitsufuji
ExHall D Poster #260
Mimic In-Context Learning for Multimodal Tasks Poster Session 6
Yuchu Jiang ⋅ Jiale Fu ⋅ chenduo hao ⋅ Xinting Hu ⋅ Yingzhe Peng ⋅ Xin Geng ⋅ Xu Yang
ExHall D Poster #352
PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Poster Session 4
Qiang Zou ⋅ Shuli Cheng ⋅ Jiayi Chen
ExHall D Poster #366
Vision-Language Models Do Not Understand Negation Poster Session 6
Kumail Alhamoud ⋅ Shaden Alshammari ⋅ Yonglong Tian ⋅ Guohao Li ⋅ Philip H.S. Torr ⋅ Yoon Kim ⋅ Marzyeh Ghassemi
ExHall D Poster #331
ID-Patch: Robust ID Association for Group Photo Personalization Poster Session 1
Yimeng Zhang ⋅ Tiancheng Zhi ⋅ Jing Liu ⋅ Shen Sang ⋅ Liming Jiang ⋅ Qing Yan ⋅ Sijia Liu ⋅ Linjie Luo
ExHall D Poster #270
SLVR: Super-Light Visual Reconstruction via Blueprint Controllable Convolutions and Exploring Feature Diversity Representation Poster Session 1
Ning Ni ⋅ Libao Zhang
ExHall D Poster #22
Vision-Language Embodiment for Monocular Depth Estimation Poster Session 6
Jinchang Zhang ⋅ Guoyu Lu
ExHall D Poster #318
Layered Image Vectorization via Semantic Simplification Poster Session 2
Zhenyu Wang ⋅ Jianxi Huang ⋅ Zhida Sun ⋅ Yuanhao Gong ⋅ Daniel Cohen-Or ⋅ Min Lu
ExHall D Poster #226
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking Poster Session 4
You Wu ⋅ Xucheng Wang ⋅ Xiangyang Yang ⋅ Mengyuan Liu ⋅ Dan Zeng ⋅ Hengzhou Ye ⋅ Shuiwang Li
ExHall D Poster #123
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data Poster Session 5
Haoxin Li ⋅ Boyang Li
ExHall D Poster #365
CoA: Towards Real Image Dehazing via Compression-and-Adaptation Poster Session 3
Long Ma ⋅ Yuxin Feng ⋅ Yan Zhang ⋅ Jinyuan Liu ⋅ Weimin Wang ⋅ Guang-Yong Chen ⋅ Chengpei Xu ⋅ Zhuo Su
ExHall D Poster #52
NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation Poster Session 5
Qi Bi ⋅ Jingjun Yi ⋅ Huimin Huang ⋅ Hao Zheng ⋅ Haolan Zhan ⋅ Yawen Huang ⋅ Yuexiang Li ⋅ Xian Wu ⋅ Yefeng Zheng
ExHall D Poster #270
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model Poster Session 4
Cheng Yang ⋅ Yang Sui ⋅ Jinqi Xiao ⋅ Lingyi Huang ⋅ Yu Gong ⋅ Chendi Li ⋅ Jinghua Yan ⋅ Yu Bai ⋅ Ponnuswamy Sadayappan ⋅ Xia Hu ⋅ Bo Yuan
ExHall D Poster #381
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction Poster Session 2
Teng Hu ⋅ Jiangning Zhang ⋅ Ran Yi ⋅ Jieyu Weng ⋅ Yabiao Wang ⋅ Xianfang Zeng ⋅ Xuezhucun Xue ⋅ Lizhuang Ma
ExHall D Poster #379
Learned Binocular-Encoding Optics for RGBD Imaging Using Joint Stereo and Focus Cues Poster Session 4
Yuhui Liu ⋅ Liangxun Ou ⋅ Qiang Fu ⋅ Hadi Amata ⋅ Wolfgang Heidrich ⋅ YIFAN PENG
ExHall D Poster #22
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans? Poster Session 2
Renshuai Tao ⋅ Haoyu Wang ⋅ Yuzhe Guo ⋅ Hairong Chen ⋅ Li Zhang ⋅ Xianglong Liu ⋅ Yunchao Wei ⋅ Yao Zhao
ExHall D Poster #473
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices Poster Session 4
Jianwen Jiang ⋅ Gaojie Lin ⋅ Zhengkun Rong ⋅ Chao Liang ⋅ Yongming Zhu ⋅ Jiaqi Yang ⋅ Tianyun Zhong
ExHall D Poster #6
D^3: Scaling Up Deepfake Detection by Learning from Discrepancy Poster Session 5
Yongqi Yang ⋅ Zhihao Qian ⋅ Ye Zhu ⋅ Olga Russakovsky ⋅ Yu Wu
ExHall D Poster #271
Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising Poster Session 6
Yongli Xiang ⋅ Ziming Hong ⋅ Lina Yao ⋅ Dadong Wang ⋅ Tongliang Liu
ExHall D Poster #433
Light3R-SfM: Towards Feed-forward Structure-from-Motion Poster Session 4
Sven Elflein ⋅ Qunjie Zhou ⋅ Laura Leal-Taixe
ExHall D Poster #91
Robotic Visual Instruction Poster Session 3
Yanbang Li ⋅ ZiYang Gong ⋅ Haoyang Li ⋅ Xiaoqi Huang ⋅ Haolan Kang ⋅ Guangpingbai ⋅ Xianzheng Ma
ExHall D Poster #145
Solving Instance Detection from an Open-World Perspective Poster Session 2
Qianqian Shen ⋅ Yunhan Zhao ⋅ Nahyun Kwon ⋅ Jeeeun Kim ⋅ Yanan Li ⋅ Shu Kong
ExHall D Poster #431
Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object Detection Poster Session 1
Aming Wu ⋅ Cheng Deng
ExHall D Poster #432
Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses Poster Session 2
Yongfan Liu ⋅ Hyoukjun Kwon
ExHall D Poster #78
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Poster Session 6
Jianing "Jed" Yang ⋅ Xuweiyi Chen ⋅ Nikhil Madaan ⋅ Madhavan Iyengar ⋅ Shengyi Qian ⋅ David Fouhey ⋅ Joyce Chai
ExHall D Poster #320
LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation Poster Session 1
Chenxu Zhou ⋅ Lvchang Fu ⋅ Sida Peng ⋅ Yunzhi Yan ⋅ Zhanhua Zhang ⋅ chen yong ⋅ Jiazhi Xia ⋅ Xiaowei Zhou
ExHall D Poster #128
Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining Poster Session 2
Guanglu Dong ⋅ Tianheng Zheng ⋅ Yuanzhouhan Cao ⋅ Linbo Qing ⋅ Chao Ren
ExHall D Poster #201
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network Poster Session 1
Haoyang He ⋅ Jiangning Zhang ⋅ Yuxuan Cai ⋅ Hongxu Chen ⋅ Xiaobin Hu ⋅ Zhenye Gan ⋅ Yabiao Wang ⋅ Chengjie Wang ⋅ Yunsheng Wu ⋅ Lei Xie
ExHall D Poster #415
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues Poster Session 3
Sagar Soni ⋅ Akshay Dudhane ⋅ Hiyam Debary ⋅ Mustansar Fiaz ⋅ Muhammad Akhtar Munir ⋅ Muhammad Sohail Danish ⋅ Paolo Fraccaro ⋅ Campbell D Watson ⋅ Levente Klein ⋅ Fahad Shahbaz Khan ⋅ Salman Khan
ExHall D Poster #349
Learning Endogenous Attention for Incremental Object Detection Poster Session 6
Xiang Song ⋅ Yuhang He ⋅ Jingyuan Li ⋅ Qiang Wang ⋅ Yihong Gong
ExHall D Poster #403
Beyond Clean Training Data: A Versatile and Model-Agnostic Framework for Out-of-Distribution Detection with Contaminated Training Data Poster Session 2
Yuchuan Li ⋅ Jae-Mo Kang ⋅ Il-Min Kim
ExHall D Poster #458
Cubify Anything: Scaling Indoor 3D Object Detection Poster Session 5
Justin Lazarow ⋅ David Griffiths ⋅ Gefen Kohavi ⋅ Francisco Crespo ⋅ Afshin Dehghan
ExHall D Poster #112
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion Poster Session 6
Kai He ⋅ Chin-Hsuan Wu ⋅ Igor Gilitschenski
ExHall D Poster #45
Scale Efficient Training for Large Datasets Poster Session 4
Qing Zhou ⋅ Junyu Gao ⋅ Qi Wang
ExHall D Poster #443
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation Poster Session 5
Junjie Chen ⋅ Weilong Chen ⋅ Yifan Zuo ⋅ Yuming Fang
ExHall D Poster #94
AnyMap: Learning a General Camera Model for Structure-from-Motion with Unknown Distortion in Dynamic Scenes Poster Session 4
Andrea Porfiri Dal Cin ⋅ Georgi Dikov ⋅ Jihong Ju ⋅ Mohsen Ghafoorian
ExHall D Poster #81
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection Poster Session 5
Ahyun Seo ⋅ Minsu Cho
ExHall D Poster #101
3D-HGS: 3D Half-Gaussian Splatting Poster Session 3
Haolin Li ⋅ Jinyang Liu ⋅ Mario Sznaier ⋅ Octavia Camps
ExHall D Poster #33
Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization Poster Session 3
Feifei Li ⋅ Mi Zhang ⋅ Yiming Sun ⋅ Min Yang
ExHall D Poster #248
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control Poster Session 1
Basim Azam ⋅ Naveed Akhtar
ExHall D Poster #269
A General Adaptive Dual-level Weighting Mechanism for Remote Sensing Pansharpening Poster Session 2
Jie Huang ⋅ Haorui Chen ⋅ Jiaxuan Ren ⋅ Siran Peng ⋅ Liang-Jian Deng
ExHall D Poster #199
Controllable Human Image Generation with Personalized Multi-Garments Poster Session 6
Yisol Choi ⋅ Sangkyung Kwak ⋅ Sihyun Yu ⋅ Hyungwon Choi ⋅ Jinwoo Shin
ExHall D Poster #245
What’s in the Image? A Deep-Dive into the Vision of Vision Language Models Poster Session 3
Omri Kaduri ⋅ Shai Bagon ⋅ Tali Dekel
ExHall D Poster #372
Just Dance with pi! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection Poster Session 5
Snehashis Majhi ⋅ Giacomo D'Amicantonio ⋅ Antitza Dantcheva ⋅ Quan Kong ⋅ Lorenzo Garattoni ⋅ Gianpiero Francesca ⋅ Egor Bondarev ⋅ Francois Bremond
ExHall D Poster #310
Do ImageNet-trained Models Learn Shortcuts? The Impact of Frequency Shortcuts on Generalization Poster Session 5
Shunxin Wang ⋅ Raymond Veldhuis ⋅ Nicola Strisciuglio
ExHall D Poster #397
Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection Poster Session 2
Ting Li ⋅ Mao Ye ⋅ Tianwen Wu ⋅ Nianxin Li ⋅ Shuaifeng Li ⋅ Song Tang ⋅ Luping Ji
ExHall D Poster #128
TinyFusion: Diffusion Transformers Learned Shallow Poster Session 4
Gongfan Fang ⋅ Kunjun Li ⋅ Xinyin Ma ⋅ Xinchao Wang
ExHall D Poster #223
NSD-Imagery: A Benchmark Dataset for Extending fMRI Vision Decoding Methods to Mental Imagery Poster Session 6
Reese Kneeland ⋅ Paul Scotti ⋅ Ghislain St-Yves ⋅ Jesse L Breedlove ⋅ Kendrick N Kay ⋅ Thomas Naselaris
ExHall D Poster #256
Poly-Autoregressive Prediction for Modeling Interactions Poster Session 3
Neerja Thakkar ⋅ Tara Sadjadpour ⋅ Jathushan Rajasegaran ⋅ Shiry Ginosar ⋅ Jitendra Malik
ExHall D Poster #167
ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images Poster Session 2
Yanqing Shen ⋅ Turcan Tuna ⋅ Marco Hutter ⋅ Cesar Cadena ⋅ Nanning Zheng
ExHall D Poster #123
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism Poster Session 1
Zhixiong Nan ⋅ Xianghong Li ⋅ Tao Xiang ⋅ Jifeng Dai
ExHall D Poster #434
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living Poster Session 5
Dominick Reilly ⋅ Rajatsubhra Chakraborty ⋅ Arkaprava Sinha ⋅ Manish Kumar Govind ⋅ Pu Wang ⋅ Francois Bremond ⋅ Le Xue ⋅ Srijan Das
ExHall D Poster #313
3D-MVP: 3D Multiview Pretraining for Manipulation Poster Session 5
Shengyi Qian ⋅ Kaichun Mo ⋅ Valts Blukis ⋅ David Fouhey ⋅ Dieter Fox ⋅ Ankit Goyal
ExHall D Poster #140
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction Poster Session 6
Aishwarya Agarwal ⋅ Srikrishna Karanam ⋅ Vineet Gandhi
ExHall D Poster #389
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Poster Session 3
Rick Akkerman ⋅ Haiwen Feng ⋅ Michael J. Black ⋅ Dimitrios Tzionas ⋅ Victoria Abrevaya
ExHall D Poster #173
Concept Lancet: Image Editing with Compositional Representation Transplant Poster Session 6
Jinqi Luo ⋅ Tianjiao Ding ⋅ Kwan Ho Ryan Chan ⋅ Hancheng Min ⋅ Chris Callison-Burch ⋅ Rene Vidal
ExHall D Poster #223
Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion Poster Session 1
Vitor Guizilini ⋅ Muhammad Zubair Irshad ⋅ Dian Chen ⋅ Greg Shakhnarovich ⋅ Rares Andrei Ambrus
ExHall D Poster #56
Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects Poster Session 4
Amir Barda ⋅ Matheus Gadelha ⋅ Vladimir G. Kim ⋅ Noam Aigerman ⋅ Amit H. Bermano ⋅ Thibault Groueix
ExHall D Poster #40
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery Poster Session 6
Utkarsh Mall ⋅ Cheng Perng Phoo ⋅ Mia Chiquier ⋅ Bharath Hariharan ⋅ Kavita Bala ⋅ Carl Vondrick
ExHall D Poster #297
Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation Poster Session 4
Long Tung Vuong ⋅ Hoang Phan ⋅ Vy Vo ⋅ Anh Tuan Bui ⋅ Thanh-Toan Do ⋅ Trung Le ⋅ Dinh Phung
ExHall D Poster #398
Noise-Resistant Video Anomaly Detection via RGB Error-Guided Multiscale Predictive Coding and Dynamic Memory Poster Session 4
Han Hu ⋅ Wenli Du ⋅ Peng Liao ⋅ Bing Wang ⋅ Siyuan Fan
ExHall D Poster #316
Investigating the Role of Weight Decay in Enhancing Nonconvex SGD Poster Session 3
Tao Sun ⋅ Yuhao Huang ⋅ Li Shen ⋅ Kele Xu ⋅ Bao Wang
ExHall D Poster #444
FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts Poster Session 2
Tongyuan Bai ⋅ Wangyuanfan Bai ⋅ Dong Chen ⋅ Tieru Wu ⋅ Manyi Li ⋅ Rui Ma
ExHall D Poster #43
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction Poster Session 5
Jingcheng Ni ⋅ Yuxin Guo ⋅ Yichen Liu ⋅ Rui Chen ⋅ Lewei Lu ⋅ Zehuan Wu
ExHall D Poster #127
Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation Poster Session 2
Zhaoyang Li ⋅ Yuan Wang ⋅ Wangkai Li ⋅ Tianzhu Zhang ⋅ Xiang Liu
ExHall D Poster #426
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction Poster Session 2
Yuan Wang ⋅ Yali Li ⋅ Lixiang Li ⋅ Shengjin Wang
ExHall D Poster #171
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding Poster Session 3
Chenxin Tao ⋅ Shiqian Su ⋅ Xizhou Zhu ⋅ Chenyu Zhang ⋅ Zhe Chen ⋅ Jiawen Liu ⋅ Wenhai Wang ⋅ Lewei Lu ⋅ Gao Huang ⋅ Yu Qiao ⋅ Jifeng Dai
ExHall D Poster #374
TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion Poster Session 4
Haoyue Liu ⋅ Jinghan Xu ⋅ Yi Chang ⋅ Hanyu Zhou ⋅ Haozhi Zhao ⋅ Lin Wang ⋅ Luxin Yan
ExHall D Poster #176
OccMamba: Semantic Occupancy Prediction with State Space Models Poster Session 3
Heng Li ⋅ Yuenan Hou ⋅ Xiaohan Xing ⋅ Yuexin Ma ⋅ Xiao Sun ⋅ Yanyong Zhang
ExHall D Poster #126
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Poster Session 5
Hao Wen ⋅ Zehuan Huang ⋅ Yaohui Wang ⋅ Xinyuan Chen ⋅ Lu Sheng
ExHall D Poster #55
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models Poster Session 1
Jin Wang ⋅ Chenghui Lv ⋅ Xian Li ⋅ Shichao Dong ⋅ Huadong Li ⋅ kelu Yao ⋅ Chao Li ⋅ Wenqi Shao ⋅ Ping Luo
ExHall D Poster #388
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Poster Session 5
Yassir Bendou ⋅ Amine Ouasfi ⋅ Vincent Gripon ⋅ Adnane Boukhayma
ExHall D Poster #387
One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models Poster Session 5
Senmao Li ⋅ Lei Wang ⋅ Kai Wang ⋅ Tao Liu ⋅ Jiehang Xie ⋅ Joost van de Weijer ⋅ Fahad Shahbaz Khan ⋅ Shiqi Yang ⋅ Yaxing Wang ⋅ Jian Yang
ExHall D Poster #240
Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning Poster Session 6
Jiuyang Dong ⋅ Junjun Jiang ⋅ Kui Jiang ⋅ Jiahan Li ⋅ Yongbing Zhang
ExHall D Poster #447
Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM Poster Session 3
Yizhou Huang ⋅ Yihua Cheng ⋅ Kezhi Wang
ExHall D Poster #136
DiC: Rethinking Conv3x3 Designs in Diffusion Models Poster Session 1
Yuchuan Tian ⋅ Jing Han ⋅ Chengcheng Wang ⋅ Yuchen Liang ⋅ Chao Xu ⋅ Hanting Chen
ExHall D Poster #220
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction Poster Session 4
Kuang Wu ⋅ Chuan Yang ⋅ Zhanbin Li
ExHall D Poster #130
S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting Poster Session 1
Yecong Wan ⋅ Mingwen Shao ⋅ Yuanshuo Cheng ⋅ Wangmeng Zuo
ExHall D Poster #51
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method Poster Session 1
Pan Yin ⋅ Kaiyu Li ⋅ Xiangyong Cao ⋅ Jing Yao ⋅ Lei Liu ⋅ Xueru Bai ⋅ Feng Zhou ⋅ Deyu Meng
ExHall D Poster #127
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes Poster Session 2
Jinxiu Liu ⋅ Shaoheng Lin ⋅ Yinxiao Li ⋅ Ming-Hsuan Yang
ExHall D Poster #67
VideoGigaGAN: Towards Detail-rich Video Super-Resolution Poster Session 1
Yiran Xu ⋅ Taesung Park ⋅ Richard Zhang ⋅ Yang Zhou ⋅ Eli Shechtman ⋅ Feng Liu ⋅ Jia-Bin Huang ⋅ Difan Liu
ExHall D Poster #185
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices Poster Session 1
Hao Yu ⋅ Tangyu Jiang ⋅ Shuning Jia ⋅ Shannan Yan ⋅ Shunning Liu ⋅ Haolong Qian ⋅ Guanghao Li ⋅ Shuting Dong ⋅ Chun Yuan
ExHall D Poster #416
MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning Poster Session 2
Xu Han ⋅ Yuan Tang ⋅ Jinfeng Xu ⋅ Xianzhi Li
ExHall D Poster #116
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted Poster Session 2
Shuaiwei Yuan ⋅ Junyu Dong ⋅ Yuezun Li
ExHall D Poster #324
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing Poster Session 3
Jingxuan Wei ⋅ Cheng Tan ⋅ Qi Chen ⋅ Gaowei Wu ⋅ Siyuan Li ⋅ Zhangyang Gao ⋅ Linzhuang Sun ⋅ Bihui Yu ⋅ Ruifeng Guo
ExHall D Poster #254
Active Data Curation Effectively Distills Large-Scale Multimodal Models Poster Session 3
Vishaal Udandarao ⋅ Nikhil Parthasarathy ⋅ Muhammad Ferjad Naeem ⋅ Talfan Evans ⋅ Samuel Albanie ⋅ Federico Tombari ⋅ Yongqin Xian ⋅ Alessio Tonioni ⋅ Olivier J Henaff
ExHall D Poster #361
Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning Poster Session 5
Li-Jun Zhao ⋅ Zhen-Duo Chen ⋅ Yongxin Wang ⋅ Xin Luo ⋅ Xin-Shun Xu
ExHall D Poster #441
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions Poster Session 3
Sirui Xu ⋅ Hung Yu Ling ⋅ Yu-Xiong Wang ⋅ Liangyan Gui
ExHall D Poster #155
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Poster Session 3
Yuying Ge ⋅ Yizhuo Li ⋅ Yixiao Ge ⋅ Ying Shan
ExHall D Poster #282
Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering Poster Session 1
Yutao Feng ⋅ Xiang Feng ⋅ Yintong Shang ⋅ Ying Jiang ⋅ Chang Yu ⋅ Zeshun Zong ⋅ Tianjia Shao ⋅ Hongzhi Wu ⋅ Kun Zhou ⋅ Chenfanfu Jiang ⋅ Yin Yang
ExHall D Poster #33
NoPain: No-box Point Cloud Attack via Optimal Transport Singular Boundary Poster Session 1
Zezeng Li ⋅ Xiaoyu Du ⋅ Na Lei ⋅ Liming Chen ⋅ Weimin Wang
ExHall D Poster #317
BHViT: Binarized Hybrid Vision Transformer Poster Session 1
Tian Gao ⋅ Yu Zhang ⋅ Zhiyuan Zhang ⋅ Huajun Liu ⋅ Kaijie Yin ⋅ Cheng-Zhong Xu ⋅ Hui Kong
ExHall D Poster #323
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction Poster Session 6
Yuanhui Huang ⋅ Amonnut Thammatadatrakoon ⋅ Wenzhao Zheng ⋅ Yunpeng Zhang ⋅ Dalong Du ⋅ Jiwen Lu
ExHall D Poster #126
Improving Accuracy and Calibration via Differentiated Deep Mutual Learning Poster Session 5
Han Liu ⋅ Peng Cui ⋅ Bingning Wang ⋅ Weipeng Chen ⋅ Yupeng Zhang ⋅ Jun Zhu ⋅ Xiaolin Hu
ExHall D Poster #459
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training Poster Session 6
Xuanpu Zhang ⋅ Dan Song ⋅ pengxin zhan ⋅ Tianyu Chang ⋅ Jianhao Zeng ⋅ Qing-Guo Chen ⋅ Weihua Luo ⋅ An-An Liu
ExHall D Poster #20
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Poster Session 5
Jiuhai Chen ⋅ Jianwei Yang ⋅ Haiping Wu ⋅ Dianqi Li ⋅ Jianfeng Gao ⋅ Tianyi Zhou ⋅ Bin Xiao
ExHall D Poster #372
Implicit Correspondence Learning for Image-to-Point Cloud Registration Poster Session 4
Xinjun Li ⋅ Wenfei Yang ⋅ Jiacheng Deng ⋅ Zhixin Cheng ⋅ Xu Zhou ⋅ Tianzhu Zhang
ExHall D Poster #106
nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation Benchmark Poster Session 4
Yanfeng Zhou ⋅ Lingrui Li ⋅ Le Lu ⋅ Minfeng Xu
ExHall D Poster #480
Neural Video Compression with Context Modulation Poster Session 3
Chuanbo Tang ⋅ Zhuoyuan Li ⋅ Yifan Bian ⋅ Li Li ⋅ Dong Liu
ExHall D Poster #181
AniMer: Animal Pose and Shape Estimation Using Family Aware Transformer Poster Session 4
Jin Lyu ⋅ Tianyi Zhu ⋅ Yi Gu ⋅ Li Lin ⋅ Pujin Cheng ⋅ Yebin Liu ⋅ Xiaoying Tang ⋅ Liang An
ExHall D Poster #161
Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection Poster Session 2
Yun Zhu ⋅ Le Hui ⋅ Hang Yang ⋅ Jianjun Qian ⋅ Jin Xie ⋅ Jian Yang
ExHall D Poster #432
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting Poster Session 3
Shaofei Cai ⋅ Zihao Wang ⋅ Kewei Lian ⋅ Zhancun Mu ⋅ Xiaojian Ma ⋅ Anji Liu ⋅ Yitao Liang
ExHall D Poster #142
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation Poster Session 3
Xiao Cui ⋅ Yulei Qin ⋅ Wengang Zhou ⋅ Hongsheng Li ⋅ Houqiang Li
ExHall D Poster #440
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key Poster Session 3
Zhihe Yang ⋅ Xufang Luo ⋅ Dongqi Han ⋅ Yunjian Xu ⋅ Dongsheng Li
ExHall D Poster #373
Cross-Modal 3D Representation with Multi-View Images and Point Clouds Poster Session 1
Ziyang Zhou ⋅ Pinghui Wang ⋅ Zi Liang ⋅ Haitao Bai ⋅ Ruofei Zhang
ExHall D Poster #339
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer Poster Session 6
Hao Chen ⋅ Ze Wang ⋅ Xiang Li ⋅ Ximeng Sun ⋅ Fangyi Chen ⋅ Jiang Liu ⋅ Jindong Wang ⋅ Bhiksha Raj ⋅ Zicheng Liu ⋅ Emad Barsoum
ExHall D Poster #209
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Poster Session 5
Chaoyou Fu ⋅ Yuhan Dai ⋅ Yongdong Luo ⋅ Lei Li ⋅ Shuhuai Ren ⋅ Renrui Zhang ⋅ Zihan Wang ⋅ Chenyu Zhou ⋅ Yunhang Shen ⋅ Mengdan Zhang ⋅ Peixian Chen ⋅ Yanwei Li ⋅ Shaohui Lin ⋅ Sirui Zhao ⋅ Ke Li ⋅ Tong Xu ⋅ Xiawu Zheng ⋅ Enhong Chen ⋅ Caifeng Shan ⋅ Ran He ⋅ Xing Sun
ExHall D Poster #295
Bridging Modalities: Improving Universal Multimodal Retrieval by Multimodal Large Language Models Poster Session 2
Xin Zhang ⋅ Yanzhao Zhang ⋅ Wen Xie ⋅ Mingxin Li ⋅ Ziqi Dai ⋅ Dingkun Long ⋅ Pengjun Xie ⋅ Meishan Zhang ⋅ Wenjie Li ⋅ Min Zhang
ExHall D Poster #372
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Poster Session 2
Lang Lin ⋅ Xueyang Yu ⋅ Ziqi Pang ⋅ Yu-Xiong Wang
ExHall D Poster #314
DeformCL: Learning Deformable Centerline Representation for Vessel Extraction in 3D Medical Image Poster Session 6
Ziwei Zhao ⋅ Zhixing Zhang ⋅ Yuhang Liu ⋅ Zhao Zhang ⋅ Haojun Yu ⋅ Dong Wang ⋅ Liwei Wang
ExHall D Poster #454
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model Poster Session 1
Benlin Liu ⋅ Yuhao Dong ⋅ Yiqin Wang ⋅ Zixian Ma ⋅ Yansong Tang ⋅ Luming Tang ⋅ Yongming Rao ⋅ Wei-Chiu Ma ⋅ Ranjay Krishna
ExHall D Poster #344
PrEditor3D: Fast and Precise 3D Shape Editing Poster Session 1
Ziya Erkoc ⋅ Can Gümeli ⋅ Chaoyang Wang ⋅ Matthias Nießner ⋅ Angela Dai ⋅ Peter Wonka ⋅ Hsin-Ying Lee ⋅ Peiye Zhuang
ExHall D Poster #44
CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth Poster Session 3
Zhiyu Qu ⋅ Yunqi Miao ⋅ Zhensong Zhang ⋅ Jifei Song ⋅ Jiankang Deng ⋅ Yi-Zhe Song
ExHall D Poster #15
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction Poster Session 2
Sicheng Zuo ⋅ Wenzhao Zheng ⋅ Yuanhui Huang ⋅ Jie Zhou ⋅ Jiwen Lu
ExHall D Poster #134