CVPR 2025 Accepted Papers
This page is cached for 1 hour. Changes to affiliation or name in your local profile may take up to 60 minutes to appear here.
WISH: Weakly Supervised Instance Segmentation using Heterogeneous Labels
Hyeokjun Kweon · Kuk-Jin Yoon
|
ExHall D Poster #414 | |
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
Poster Session 3
Jiazi Bu · Pengyang Ling · Pan Zhang · Tong Wu · Xiaoyi Dong · Yuhang Zang · Yuhang Cao · Dahua Lin · Jiaqi Wang
|
ExHall D Poster #224 | |
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
Jie Tian · Xiaoye Qu · Zhenyi Lu · Wei Wei · Sichen Liu · Yu Cheng
|
ExHall D Poster #177 | |
Universal Scene Graph Generation
Shengqiong Wu · Hao Fei · Tat-seng Chua
|
ExHall D Poster #336 | |
Removing Reflections from RAW Photos
Poster Session 1
Eric Kee · Adam Pikielny · Kevin Blackburn-Matzen · Marc Levoy
|
ExHall D Poster #169 | |
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Poster Session 6
Ziyang Zhang · Yang Yu · Yucheng Chen · Xulei Yang · Si Yong Yeo
|
ExHall D Poster #345 | |
Infighting in the Dark: Multi-Label Backdoor Attack in Federated Learning
Poster Session 5
Ye Li · Yanchao Zhao · chengcheng zhu · Jiale Zhang
|
ExHall D Poster #453 | |
Augmented Deep Contexts for Spatially Embedded Video Coding
Yifan Bian · Chuanbo Tang · Li Li · Dong Liu
|
ExHall D Poster #181 | |
A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation
Poster Session 5
Zheng Zhang · Guanchun Yin · Bo Zhang · Wu Liu · Xiuzhuang Zhou · Wendong Wang
|
ExHall D Poster #471 | |
SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization
Junchen Yu · Si-Yuan Cao · Runmin Zhang · Chenghao Zhang · Zhu Yu · Shujie Chen · Bailin Yang · Hui-Liang Shen
|
ExHall D Poster #82 | |
VIRES: Video Instance Repainting via Sketch and Text Guided Generation
Poster Session 6
Shuchen Weng · Haojie Zheng · Peixuan Zhang · Yuchen Hong · Han Jiang · Si Li · Boxin Shi
|
ExHall D Poster #215 | |
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
Poster Session 2
Mingzhen Sun · Weining Wang · Li · Jiawei Liu · Jiahui Sun · Wanquan Feng · Shanshan Lao · SiYu Zhou · Qian HE · Jing Liu
|
ExHall D Poster #191 | |
FilmComposer: LLM-Driven Music Production for Silent Film Clips
Poster Session 3
Zhifeng Xie · Qile He · Youjia Zhu · Qiwei He · Mengtian Li
|
ExHall D Poster #274 | |
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
Poster Session 4
Zixuan Huang · Mark Boss · Aaryaman Vasishta · James Rehg · Varun Jampani
|
ExHall D Poster #99 | |
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
Poster Session 1
Hao Tan · Zichang Tan · Jun Li · Ajian Liu · Jun Wan · Zhen Lei
|
ExHall D Poster #429 | |
GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation
Poster Session 2
Ruihai Wu · Ziyu Zhu · Yuran Wang · Yue Chen · Jiarui Wang · Hao Dong
|
ExHall D Poster #151 | |
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction
Poster Session 2
YUEJIAO SU · Yi Wang · Qiongyang Hu · Chuang Yang · Lap-Pui Chau
|
ExHall D Poster #350 | |
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Poster Session 5
Jiaming Zhou · Teli Ma · Kun-Yu Lin · Zifan Wang · Ronghe Qiu · Junwei Liang
|
ExHall D Poster #142 | |
Structured 3D Latents for Scalable and Versatile 3D Generation
Poster Session 5
Jianfeng XIANG · Zelong Lv · Sicheng Xu · Yu Deng · Ruicheng Wang · Bowen Zhang · Dong Chen · Xin Tong · Jiaolong Yang
|
ExHall D Poster #40 | |
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
Poster Session 2
Qihui Zhang · Munan Ning · Zheyuan Liu · Yanbo Wang · Jiayi Ye · Yue Huang · Shuo Yang · Xiao Chen · Yibing Song · Li Yuan
|
ExHall D Poster #362 | |
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Poster Session 5
Haoyang Li · Liang Wang · Chao Wang · Jing Jiang · Yan Peng · Guodong Long
|
ExHall D Poster #438 | |
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering
Poster Session 6
Guofeng Feng · Siyan Chen · Rong Fu · Zimu Liao · Yi Wang · Tao Liu · Boni Hu · Linning Xu · PeiZhilin · Hengjie Li · Xiuhong Li · Ninghui Sun · Xingcheng Zhang · Bo Dai
|
ExHall D Poster #47 | |
PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation
Poster Session 3
Uyoung Jeong · Jonathan Freer · Seungryul Baek · Hyung Jin Chang · Kwang In Kim
|
ExHall D Poster #156 | |
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Poster Session 6
Wenbin An · Feng Tian · Sicong Leng · Jiahao Nie · Haonan Lin · QianYing Wang · Ping Chen · Xiaoqin Zhang · Shijian Lu
|
ExHall D Poster #360 | |
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Poster Session 6
Han Xiao · yina xie · Guanxin tan · Yinghao Chen · Rui Hu · Ke Wang · Aojun Zhou · Hao Li · Hao Shao · Xudong LU · Peng Gao · Yafei Wen · Xiaoxin Chen · Shuai Ren · Hongsheng Li
|
ExHall D Poster #325 | |
MagicArticulate: Make Your 3D Models Articulation-Ready
Poster Session 4
Chaoyue Song · Jianfeng Zhang · Xiu Li · Fan Yang · Yiwen Chen · Zhongcong Xu · Jun Hao Liew · Xiaoyang Guo · Fayao Liu · Jiashi Feng · Guosheng Lin
|
ExHall D Poster #13 | |
Vision-Guided Action: Enhancing 3D Human Motion Prediction with Gaze-informed Affordance in 3D Scenes
Poster Session 3
Ting Yu · Yi Lin · Jun Yu · Zhenyu Lou · Qiongjie Cui
|
ExHall D Poster #161 | |
ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion
Poster Session 1
Nissim Maruani · Wang Yifan · Matthew Fisher · Pierre Alliez · Mathieu Desbrun
|
ExHall D Poster #41 | |
AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction
Poster Session 2
Yuanbin Man · Ying Huang · Chengming Zhang · Bingzhe Li · Wei Niu · Miao Yin
|
ExHall D Poster #301 | |
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Poster Session 4
Hanyang Wang · Fangfu Liu · Jiawei Chi · Yueqi Duan
|
ExHall D Poster #61 | |
Breaking the Low-Rank Dilemma of Linear Attention
Poster Session 5
Qihang Fan · Huaibo Huang · Ran He
|
ExHall D Poster #404 | |
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach
Poster Session 4
Baoshun Tong · Hanjiang Lai · Yan Pan · Jian Yin
|
ExHall D Poster #392 | |
Don't Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving
Poster Session 5
Ziying Song · Caiyan Jia · Lin Liu · Hongyu Pan · Yongchang Zhang · Junming Wang · Xingyu Zhang · Shaoqing Xu · Lei Yang · Yadan Luo
|
ExHall D Poster #131 | |
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension
Poster Session 3
Xiaofu Chen · Yaxin Luo · Luo · Jiayi Ji · Henghui Ding · Yiyi Zhou
|
ExHall D Poster #353 | |
Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
Poster Session 3
Bo Wang · Dingwei Tan · Yen-Ling Kuo · Zhaowei Sun · Jeremy M Wolfe · Tat-Jen Cham · Mengmi Zhang
|
ExHall D Poster #398 | |
Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding
Zhaoran Zhao · Peng Lu · Anran Zhang · Pei Pei Li · Xia Li · Xuannan Liu · Yang Hu · Shiyi Chen · liweiwang · Wenhao Guo
|
ExHall D Poster #360 | |
Let's Chorus: Partner-aware Hybrid Song-Driven 3D Head Animation
Poster Session 2
Xiumei Xie · Zikai Huang · Wenhao Xu · Peng Xiao · Xuemiao Xu · Huaidong Zhang
|
ExHall D Poster #2 | |
STiL: Semi-supervised Tabular-Image Learning for Comprehensive Task-Relevant Information Exploration in Multimodal Classification
Poster Session 3
Siyi Du · Xinzhe Luo · Declan ORegan · Chen Qin
|
ExHall D Poster #469 | |
AFL: A Single-Round Analytic Approach for Federated Learning with Pre-trained Models
Poster Session 1
Run He · Kai Tong · Di Fang · Han Sun · Ziqian Zeng · Haoran Li · Tianyi Chen · Huiping Zhuang
|
ExHall D Poster #461 | |
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Poster Session 4
Wei Huang · Qinying Gu · Nanyang Ye
|
ExHall D Poster #328 | |
Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation
Poster Session 1
Qinghe Ma · Jian Zhang · Zekun Li · Lei Qi · Qian Yu · Yinghuan Shi
|
ExHall D Poster #479 | |
Disentangled Pose and Appearance Guidance for Multi-Pose Generation
Poster Session 2
Tengfei Xiao · Yue Wu · Yuelong Li · Can Qin · Maoguo Gong · Qiguang Miao · Wenping Ma
|
ExHall D Poster #19 | |
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Poster Session 5
Xinghui Li · Qichao Sun · Pengze Zhang · Fulong Ye · Zhichao Liao · Wanquan Feng · Songtao Zhao · Qian HE
|
ExHall D Poster #258 | |
Revisiting MAE Pre-training for 3D Medical Image Segmentation
Tassilo Wald · Constantin Ulrich · Stanislav Lukyanenko · Andrei Goncharov · Alberto Paderno · Maximilian Miller · Leander Maerkisch · Paul F Jaeger · Klaus Maier-Hein
|
ExHall D Poster #480 | |
Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption
Poster Session 3
Du CHEN · Tianhe Wu · Kede Ma · Lei Zhang
|
ExHall D Poster #200 | |
Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples
Poster Session 3
WEIWEI LI · Junzhuo Liu · Yuanyuan Ren · Yuchen Zheng · Yahao Liu · Wen Li
|
ExHall D Poster #463 | |
Rethinking Token Reduction with Parameter-Efficient Fine-Tuning in ViT for Pixel-Level Tasks
Poster Session 3
Cheng Lei · Ao Li · Hu Yao · Ce Zhu · Le Zhang
|
ExHall D Poster #412 | |
SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection
Poster Session 1
Xin Lin · Chong Shi · Zuopeng Yang · Haojin Tang · Zhili Zhou
|
ExHall D Poster #419 | |
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging
Poster Session 6
Maximilian Rokuss · Yannick Kirchhoff · Seval Akbal · Balint Kovacs · Saikat Roy · Constantin Ulrich · Tassilo Wald · Lukas T. Rotkopf · Heinz-Peter Schlemmer · Klaus Maier-Hein
|
ExHall D Poster #452 | |
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation
Poster Session 3
Xingguo Lv · Xingbo Dong · Liwen Wang · Jiewen Yang · Lei Zhao · Bin Pu · Zhe Jin · Xuejun Li
|
ExHall D Poster #476 | |
LightLoc: Learning Outdoor LiDAR Localization at Light Speed
Poster Session 2
Wen Li · Chen Liu · Shangshu Yu · dq Liu · Yin Zhou · Siqi Shen · Chenglu Wen · Cheng Wang
|
ExHall D Poster #125 | |
RAEncoder: A Label-Free Reversible Adversarial Examples Encoder for Dataset Intellectual Property Protection
Poster Session 4
Fan Xing · Zhuo Tian · Xuefeng Fan · Xiaoyi Zhou
|
ExHall D Poster #462 | |
SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow
Poster Session 5
Qingyuan Wang · Rui Song · Jiaojiao Li · Kerui Cheng · David Ferstl · Yinlin Hu
|
ExHall D Poster #95 | |
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Poster Session 6
Yiqi Zhu · Ziyue Wang · Can Zhang · Peng Li · Yang Liu
|
ExHall D Poster #326 | |
Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
Poster Session 6
Xinhao Zhong · Hao Fang · Bin Chen · Xulin Gu · Meikang Qiu · Shuhan Qi · Shu-Tao Xia
|
ExHall D Poster #413 | |
Synthetic Prior for Few-Shot Drivable Head Avatar Inversion
Poster Session 3
Wojciech Zielonka · Stephan J. Garbin · Alexandros Lattas · George Kopanas · Paulo Gotardo · Thabo Beeler · Justus Thies · Timo Bolkart
|
ExHall D Poster #8 | |
Gaussian Eigen Models for Human Heads
Poster Session 4
Wojciech Zielonka · Timo Bolkart · Thabo Beeler · Justus Thies
|
ExHall D Poster #7 | |
DV-Matcher: Deformation-based Non-rigid Point Cloud Matching Guided by Pre-trained Visual Features
Poster Session 6
Zhangquan Chen · Puhua Jiang · Ruqi Huang
|
ExHall D Poster #106 | |
Recognition-Synergistic Scene Text Editing
Poster Session 3
Zhengyao Fang · Pengyuan Lyu · Jingjing Wu · Chengquan Zhang · Jun Yu · Guangming Lu · Wenjie Pei
|
ExHall D Poster #234 | |
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement
Poster Session 1
Yun Liu · Chengwen Zhang · Ruofan Xing · Bingda Tang · Bowen Yang · Li Yi
|
ExHall D Poster #149 | |
STAR-Edge: Structure-aware Local Spherical Curve Representation for Thin-walled Edge Extraction from Unstructured Point Clouds
Poster Session 6
Zikuan Li · Honghua Chen · Yuecheng Wang · Sibo Wu · Mingqiang Wei · Jun Wang
|
ExHall D Poster #105 | |
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting
Poster Session 4
Shu-Wei Lu · Yi-Hsuan Tsai · Yi-Ting Chen
|
ExHall D Poster #125 | |
PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset
Poster Session 4
Jiazhen Liu · Yuhan Fu · Ruobing Xie · Runquan Xie · Xingwu Sun · Fengzong Lian · Zhanhui Kang · Xirong Li
|
ExHall D Poster #386 | |
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering
Tianyu Huai · Jie Zhou · Xingjiao Wu · Qin Chen · Qingchun Bai · Zezhou · Liang He
|
ExHall D Poster #362 | |
Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding
Wei Suo · Lijun Zhang · Mengyang Sun · Lin Yuanbo Wu · Peng Wang · Yanning Zhang
|
ExHall D Poster #359 | |
Active Event-based Stereo Vision
Poster Session 1
Jianing Li · Yunjian Zhang · Haiqian Han · Xiangyang Ji
|
ExHall D Poster #75 | |
Parametric Point Cloud Completion for Polygonal Surface Reconstruction
Poster Session 3
Zhaiyu Chen · Yuqing Wang · Liangliang Nan · Xiao Xiang Zhu
|
ExHall D Poster #106 | |
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error
Poster Session 3
Beilin Chu · Xuan Xu · Xin Wang · Yufei Zhang · Weike You · Linna Zhou
|
ExHall D Poster #208 | |
Decoupling Training-Free Guided Diffusion by ADMM
Poster Session 5
Youyuan Zhang · Zehua Liu · Zenan Li · Zhaoyu Li · James Clark · Xujie Si
|
ExHall D Poster #213 | |
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation
Poster Session 3
Yuan Gan · Jiaxu Miao · Yunze Wang · Yi Yang
|
ExHall D Poster #266 | |
Event Fields: Capturing Light Fields at High Speed, Resolution, and Dynamic Range
Ziyuan Qu · Zihao Zou · Vivek Boominathan · Praneeth Chakravarthula · Adithya Pediredla
|
ExHall D Poster #73 | |
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
Poster Session 6
Boseung Jeong · Jicheol Park · Sungyeon Kim · Suha Kwak
|
ExHall D Poster #263 | |
GENIUS: A Generative Framework for Universal Multimodal Search
Poster Session 4
Sungyeon Kim · Xinliang Zhu · Xiaofan Lin · Muhammet Bastan · Douglas Gray · Suha Kwak
|
ExHall D Poster #367 | |
Take the Bull by the Horns: Learning to Segment Hard Samples
Poster Session 3
Yuan Guo · Jingyu Kong · Yu Wang · Yuping Duan
|
ExHall D Poster #478 | |
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation
Poster Session 1
Amin Karimi · Charalambos Poullis
|
ExHall D Poster #423 | |
Understanding Multi-layered Transmission Matrices
Marina Alterman · Anat Levin
|
ExHall D Poster #201 | |
Self-Supervised Spatial Correspondence Across Modalities
Poster Session 2
Ayush Shrivastava · Andrew Owens
|
ExHall D Poster #96 | |
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer
Poster Session 2
Ziyi Liu · Yangcen Liu
|
ExHall D Poster #319 | |
Gradient-Guided Annealing for Domain Generalization
Poster Session 4
Aristotelis Ballas · Christos Diou
|
ExHall D Poster #452 | |
Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object Detection
Poster Session 1
Aming Wu · Cheng Deng
|
ExHall D Poster #432 | |
Token Cropr: Faster ViTs for Quite a Few Tasks
Poster Session 2
Benjamin Bergner · Christoph Lippert · Aravindh Mahendran
|
ExHall D Poster #416 | |
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments
Poster Session 3
Can Zhang · Gim Hee Lee
|
ExHall D Poster #143 | |
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
Poster Session 4
Bo-Wen Yin · Jiao-Long Cao · Ming-Ming Cheng · Qibin Hou
|
ExHall D Poster #338 | |
DiffFNO: Diffusion Fourier Neural Operator
Poster Session 1
Xiaoyi Liu · Hao Tang
|
ExHall D Poster #195 | |
Effortless Active Labeling for Long-Term Test-Time Adaptation
Poster Session 5
Guowei Wang · Changxing Ding
|
ExHall D Poster #439 | |
Matrix-Free Shared Intrinsics Bundle Adjustment
Poster Session 6
Daniel Safari
|
ExHall D Poster #83 | |
End-to-End Implicit Neural Representations for Classification
Poster Session 4
Alexander Gielisse · Jan van Gemert
|
ExHall D Poster #281 | |
Online Task-Free Continual Learning via Dynamic Expansionable Memory Distribution
Poster Session 4
Fei Ye · Adrian Bors
|
ExHall D Poster #448 | |
DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting
Poster Session 5
Seungjun Lee · Gim Hee Lee
|
ExHall D Poster #65 | |
ViKIENet: Towards Efficient 3D Object Detection with Virtual Key Instance Enhanced Network
Poster Session 3
Zhuochen Yu · Bijie Qiu · Andy W. H. Khong
|
ExHall D Poster #116 | |
Sea-ing in Low-light
Poster Session 4
Nisha Varghese · A. N. Rajagopalan
|
ExHall D Poster #76 | |
DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes
Poster Session 5
Ashish Kumar · A. N. Rajagopalan
|
ExHall D Poster #64 | |
GPS as a Control Signal for Image Generation
Poster Session 1
Chao Feng · Ziyang Chen · Aleksander Holynski · Alexei A. Efros · Andrew Owens
|
ExHall D Poster #250 | |
Dynamic Content Prediction with Motion-aware Priors for Blind Face Video Restoration
Poster Session 4
Lianxin Xie · csbingbing zheng · Si Wu · Hau San Wong
|
ExHall D Poster #192 | |
Seeing is Not Believing: Adversarial Natural Object Optimization for Hard-Label 3D Scene Attacks
Poster Session 3
Daizong Liu · Wei Hu
|
ExHall D Poster #120 | |
Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants
Poster Session 3
Chong Yu · Tao Chen · Zhongxue Gan
|
ExHall D Poster #389 | |
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Poster Session 4
Khanh Nguyen · Ghulam Mubashar Hassan · Ajmal Mian
|
ExHall D Poster #110 | |
On the Generalization of Handwritten Text Recognition Models
Poster Session 3
Carlos Garrido-Munoz · Jorge Calvo-Zaragoza
|
ExHall D Poster #443 | |
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Poster Session 1
Claudia Cuttano · Gabriele Trivigno · Gabriele Rosi · Carlo Masone · Giuseppe Averta
|
ExHall D Poster #308 | |
Doppelgängers and Adversarial Vulnerability
George Kamberov
|
ExHall D Poster #464 | |
Adapting to Observation Length of Trajectory Prediction via Contrastive Learning
Poster Session 1
Ruiqi Qiu · JUN GONG · Xinyu Zhang · Siqi Luo · Bowen Zhang · Yi Cen
|
ExHall D Poster #138 | |
Dynamic Stereotype Theory Induced Micro-expression Recognition with Oriented Deformation
Poster Session 3
Bohao Zhang · Xuejiao Wang · Changbo Wang · Gaoqi He
|
ExHall D Poster #5 | |
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
Poster Session 4
Henghui Du · Guangyao Li · Chang Zhou · Chunjie Zhang · Alan Zhao · Di Hu
|
ExHall D Poster #288 | |
LC-Mamba: Local and Continuous Mamba with Shifted Windows for Frame Interpolation
Poster Session 4
Min Wu Jeong · Chae Eun Rhee
|
ExHall D Poster #178 | |
Shading Meets Motion: Self-supervised Indoor 3D Reconstruction Via Simultaneous Shape-from-Shading and Structure-from-Motion
Poster Session 4
Guoyu Lu
|
ExHall D Poster #64 | |
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Poster Session 4
Hyunsoo Kim · Donghyun Kim · Suhyun Kim
|
ExHall D Poster #234 | |
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data
Poster Session 5
Haoxin Li · Boyang Li
|
ExHall D Poster #365 | |
Odd-One-Out: Anomaly Detection by Comparing with Neighbors
Poster Session 4
Ankan Kumar Bhunia · Changjian Li · Hakan Bilen
|
ExHall D Poster #437 | |
DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation
Poster Session 4
Xiaoliang Ju · Hongsheng Li
|
ExHall D Poster #36 | |
4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians
Poster Session 6
Hidenobu Matsuki · Gwangbin Bae · Andrew J. Davison
|
ExHall D Poster #74 | |
PreciseCam: Precise Camera Control for Text-to-Image Generation
Poster Session 1
Edurne Bernal-Berdun · Ana Serrano · Belen Masia · Matheus Gadelha · Yannick Hold-Geoffroy · Xin Sun · Diego Gutierrez
|
ExHall D Poster #246 | |
Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning
Poster Session 1
Huabin Liu · Filip Ilievski · Cees G. M. Snoek
|
ExHall D Poster #296 | |
Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach
Poster Session 2
Chen-Chen Zong · Sheng-Jun Huang
|
ExHall D Poster #455 | |
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks
Poster Session 6
Han Wang · Gang Wang · Huan Zhang
|
ExHall D Poster #363 | |
OffsetOPT: Explicit Surface Reconstruction without Normals
Poster Session 3
Huan Lei
|
ExHall D Poster #104 | |
Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection
Poster Session 5
Qi Chen · Hu Ding
|
ExHall D Poster #449 | |
FIFA: Fine-grained Inter-frame Attention for Driver's Video Gaze Estimation
Poster Session 4
Daosong Hu · Mingyue Cui · Kai Huang
|
ExHall D Poster #284 | |
Sketchy Bounding-box Supervision for 3D Instance Segmentation
Poster Session 2
qian deng · Le Hui · Jin Xie · Jian Yang
|
ExHall D Poster #336 | |
MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation
Poster Session 4
zhuangzhuang chen · hualiang wang · Chubin Ou · Xiaomeng Li
|
ExHall D Poster #483 | |
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI
Poster Session 3
Sangmin Lee · Sungyong Park · Heewon Kim
|
ExHall D Poster #146 | |
SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering
Poster Session 4
Hanxiao Sun · Yupeng Gao · Jin Xie · Jian Yang · Beibei Wang
|
ExHall D Poster #28 | |
PERSE: Personalized 3D Generative Avatars from A Single Portrait
Poster Session 4
Hyunsoo Cha · Inhee Lee · Hanbyul Joo
|
ExHall D Poster #9 | |
Improving Editability in Image Generation with Layer-wise Memory
Poster Session 2
Daneul Kim · Jaeah Lee · Jaesik Park
|
ExHall D Poster #241 | |
CDI: Copyrighted Data Identification in Diffusion Models
Poster Session 4
Jan Dubiński · Antoni Kowalczuk · Franziska Boenisch · Adam Dziedzic
|
ExHall D Poster #276 | |
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention
Poster Session 5
Kyungmin Jo · Jooyeol Yun · Jaegul Choo
|
ExHall D Poster #244 | |
Camera Resection from Known Line Pencils and a Radially Distorted Scanline
Poster Session 4
Juan Carlos Dibene Simental · Enrique Dunn
|
ExHall D Poster #80 | |
Community Forensics: Using Thousands of Generators to Train Fake Image Detectors
Poster Session 2
Jeongsoo Park · Andrew Owens
|
ExHall D Poster #274 | |
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges
Poster Session 2
Joshua Fixelle
|
ExHall D Poster #417 | |
Enhancing Creative Generation on Stable Diffusion-based Models
Poster Session 6
Jiyeon Han · Dahee Kwon · Gayoung Lee · Junho Kim · Jaesik Choi
|
ExHall D Poster #233 | |
Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image Denoising
Poster Session 2
Feiran Li · Haiyang Jiang · Daisuke Iso
|
ExHall D Poster #24 | |
Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect Detection
Poster Session 5
Feng Yan · Xiaoheng Jiang · Yang Lu · Jiale Cao · Dong Chen · Mingliang Xu
|
ExHall D Poster #272 | |
SfM-Free 3D Gaussian Splatting via Hierarchical Training
Poster Session 5
Bo Ji · Angela Yao
|
ExHall D Poster #57 | |
Heterogeneous Skeleton-Based Action Representation Learning
Poster Session 4
Xiaoyan Ma · jidong kuang · Hongsong Wang · Jie Gui
|
ExHall D Poster #320 | |
Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness
Beier Zhu · Jiequan Cui · Hanwang Zhang · Chi Zhang
|
ExHall D Poster #424 | |
Towards Realistic Example-based Modeling via 3D Gaussian Stitching
Poster Session 6
Xinyu Gao · Ziyi Yang · Bingchen Gong · Xiaoguang Han · Sipeng Yang · Xiaogang Jin
|
ExHall D Poster #42 | |
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry
Poster Session 5
Jing Li · Yihang Fu · Falai Chen
|
ExHall D Poster #37 | |
Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features
Poster Session 6
Wenhuan Huang · Yi JI · guiqian zhu · Ying Li · chunping Liu
|
ExHall D Poster #315 | |
Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression
Poster Session 4
Boqian Zhang · shen yang · Hao Chen · Chao Yang · Jing Jia · Guang Jiang
|
ExHall D Poster #112 | |
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Poster Session 5
Ali Hatamizadeh · Jan Kautz
|
ExHall D Poster #403 | |
Beyond Clean Training Data: A Versatile and Model-Agnostic Framework for Out-of-Distribution Detection with Contaminated Training Data
Poster Session 2
Yuchuan Li · Jae-Mo Kang · Il-Min Kim
|
ExHall D Poster #458 | |
Do Your Best and Get Enough Rest for Continual Learning
Poster Session 2
Hankyul Kang · Gregor Seifer · Donghyun Lee · Jongbin Ryu
|
ExHall D Poster #448 | |
ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration
Poster Session 2
Johan Edstedt · André Mateus · Alberto Jaenal
|
ExHall D Poster #115 | |
TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation
Poster Session 5
Abduljalil Radman · Jorma Laaksonen
|
ExHall D Poster #280 | |
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection
Poster Session 4
Farzad Beizaee · Gregory A. Lodygensky · Christian Desrosiers · Jose Dolz
|
ExHall D Poster #314 | |
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Poster Session 3
Jongseong Bae · Junwoo Ha · Ha Young Kim
|
ExHall D Poster #125 | |
VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks
Poster Session 6
Jinseong Jang · Chunfei Ma · Byeongwon Lee
|
ExHall D Poster #375 | |
Improving Personalized Search with Regularized Low-Rank Parameter Updates
Fiona Ryan · Josef Sivic · Fabian Caba Heilbron · Judy Hoffman · James Rehg · Bryan Russell
|
ExHall D Poster #376 | |
GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection
Poster Session 1
Jeffri Erwin Murrugarra Llerena · José Henrique Marques · Claudio Jung
|
ExHall D Poster #326 | |
Vision-Language Embodiment for Monocular Depth Estimation
Poster Session 6
Jinchang Zhang · Guoyu Lu
|
ExHall D Poster #318 | |
Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise
Poster Session 6
Brayan Monroy · Jorge Bacca · Julián Tachella
|
ExHall D Poster #190 | |
Conformal Prediction for Zero-Shot Models
Poster Session 4
Julio Silva-Rodríguez · Ismail Ben Ayed · Jose Dolz
|
ExHall D Poster #393 | |
On the Consistency of Video Large Language Models in Temporal Comprehension
Poster Session 3
Minjoon Jung · Junbin Xiao · Byoung-Tak Zhang · Angela Yao
|
ExHall D Poster #293 | |
Continuous Locomotive Crowd Behavior Generation
Poster Session 5
Inhwan Bae · Junoh Lee · Hae-Gon Jeon
|
ExHall D Poster #130 | |
Auto-Encoded Supervision for Perceptual Image Super-Resolution
Poster Session 4
MinKyu Lee · Sangeek Hyun · Woojin Jun · Jae-Pil Heo
|
ExHall D Poster #205 | |
Cubify Anything: Scaling Indoor 3D Object Detection
Justin Lazarow · David Griffiths · Gefen Kohavi · Francisco Crespo · Afshin Dehghan
|
ExHall D Poster #112 | |
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion
Poster Session 6
Kai He · Chin-Hsuan Wu · Igor Gilitschenski
|
ExHall D Poster #45 | |
What Makes a Good Dataset for Knowledge Distillation?
Poster Session 5
Logan Frank · Jim Davis
|
ExHall D Poster #262 | |
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Poster Session 5
Junhyuk So · Jiwoong Shin · Chaeyeon Jang · Eunhyeok Park
|
ExHall D Poster #216 | |
EmoEdit: Evoking Emotions through Image Manipulation
Poster Session 5
Jingyuan Yang · Jiawei Feng · Weibin Luo · Dani Lischinski · Daniel Cohen-Or · Hui Huang
|
ExHall D Poster #350 | |
Multiple Object Tracking as ID Prediction
Poster Session 6
Ruopeng Gao · Ji Qi · Limin Wang
|
ExHall D Poster #163 | |
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
Poster Session 5
Fernando Julio Cendra · Kai Han
|
ExHall D Poster #260 | |
Mr. DETR: Instructive Multi-Route Training for Detection Transformers
Poster Session 2
Chang-Bin Zhang · Yujie Zhong · Kai Han
|
ExHall D Poster #434 | |
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
Chang-Bin Zhang · Jinhong Ni · Yujie Zhong · Kai Han
|
ExHall D Poster #429 | |
Lifting Motion to the 3D World via 2D Diffusion
Jiaman Li · Karen Liu · Jiajun Wu
|
ExHall D Poster #164 | |
Scale Efficient Training for Large Datasets
Poster Session 4
Qing Zhou · Junyu Gao · Qi Wang
|
ExHall D Poster #443 | |
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
Poster Session 5
Junjie Chen · Weilong Chen · Yifan Zuo · Yuming Fang
|
ExHall D Poster #94 | |
Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes
Poster Session 3
Hyeonggon Ryu · Seongyu Kim · Joon Chung · Arda Senocak
|
ExHall D Poster #276 | |
From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech
Jihoon Kim · Jeongsoo Choi · Jaehun Kim · Chaeyoung Jung · Joon Chung
|
ExHall D Poster #2 | |
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Poster Session 1
Dongseob Kim · Hyunjung Shim
|
ExHall D Poster #430 | |
The Art of Deception: Color Visual Illusions and Diffusion Models
Poster Session 4
Alexandra Gomez-Villa · Kai Wang · C.Alejandro Parraga · Bartłomiej Twardowski · Jesus Malo · Javier Vazquez-Corral · Joost van de Weijer
|
ExHall D Poster #273 | |
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation
Poster Session 4
Fangyun Wei · Jinjing Zhao · Kun Yan · Chang Xu
|
ExHall D Poster #334 | |
BF-STVSR: B-Splines and Fourier---Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution
Poster Session 6
Eunjin Kim · HYEONJIN KIM · Kyong Hwan Jin · Jaejun Yoo
|
ExHall D Poster #175 | |
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models
Poster Session 3
Jaerin Lee · Daniel Jung · Kanggeon Lee · Kyoung Mu Lee
|
ExHall D Poster #226 | |
HistoFS: Non-IID Histopathologic Whole Slide Image Classification via Federated Style Transfer with RoI-Preserving
Poster Session 6
Farchan Hakim Raswa · Chun-Shien Lu · Jia-Ching Wang
|
ExHall D Poster #393 | |
AutoURDF: Unsupervised Robot Modeling from Point Cloud Frames Using Cluster Registration
Poster Session 6
Jiong Lin · Lechen Zhang · Kwansoo Lee · Jialong Ning · Judah A Goldfeder · Hod Lipson
|
ExHall D Poster #140 | |
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Poster Session 1
Akhil Perincherry · Jacob Krantz · Stefan Lee
|
ExHall D Poster #350 | |
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds
Jiahui Lei · Yijia Weng · Adam W Harley · Leonidas Guibas · Kostas Daniilidis
|
ExHall D Poster #70 | |
Sample- and Parameter-Efficient Auto-Regressive Image Models
Poster Session 6
Elad Amrani · Leonid Karlinsky · Alex M. Bronstein
|
ExHall D Poster #380 | |
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Poster Session 3
Shufan Li · Konstantinos Kallidromitis · Akash Gokul · Zichun Liao · Yusuke Kato · Kazuki Kozuka · Aditya Grover
|
ExHall D Poster #241 | |
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Poster Session 2
Fida Mohammad Thoker · Letian Jiang · Chen Zhao · Bernard Ghanem
|
ExHall D Poster #293 | |
Video Summarization with Large Language Models
Poster Session 4
Min Jung Lee · Dayoung Gong · Minsu Cho
|
ExHall D Poster #304 | |
Consistent Normal Orientation for 3D Point Clouds via Least Squares on Delaunay Graph
Poster Session 4
Rao Fu · Jianmin Zheng · Liang Yu
|
ExHall D Poster #107 | |
Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model
Poster Session 4
Haobo Jiang · Jin Xie · Jian Yang · Liang Yu · Jianmin Zheng
|
ExHall D Poster #108 | |
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild
Poster Session 1
Damien Teney · Liangze Jiang · Florin Gogianu · Ehsan Abbasnejad
|
ExHall D Poster #396 | |
Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment
Poster Session 6
Guanglu Dong · Xiangyu Liao · Mingyang Li · Guihuan Guo · Chao Ren
|
ExHall D Poster #192 | |
SLVR: Super-Light Visual Reconstruction via Blueprint Controllable Convolutions and Exploring Feature Diversity Representation
Poster Session 1
Ning Ni · Libao Zhang
|
ExHall D Poster #22 | |
Hazy Low-Quality Satellite Video Restoration Via Learning Optimal Joint Degradation Patterns and Continuous-Scale Super-Resolution Reconstruction
Poster Session 3
Ning Ni · Libao Zhang
|
ExHall D Poster #195 | |
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
Poster Session 4
Haolin Qin · Tingfa Xu · Tianhao Li · Zhenxiang Chen · Tao Feng · Jianan Li
|
ExHall D Poster #101 | |
ShiftwiseConv: Small Convolutional Kernel with Large Kernel Effect
Poster Session 5
Dachong Li · li li · zhuangzhuang chen · Jianqiang Li
|
ExHall D Poster #405 | |
Language-Guided Image Tokenization for Generation
Poster Session 4
Kaiwen Zha · Lijun Yu · Alireza Fathi · David A. Ross · Cordelia Schmid · Dina Katabi · Xiuye Gu
|
ExHall D Poster #252 | |
MaSS13K: A Matting-level Semantic Segmentation Benchmark
Poster Session 3
Chenxi Xie · Minghan LI · Hui Zeng · Jun Luo · Lei Zhang
|
ExHall D Poster #325 | |
Self-Expansion of Pre-trained Models with Mixture of Adapters for Continual Learning
Poster Session 2
Huiyi Wang · Haodong Lu · Lina Yao · Dong Gong
|
ExHall D Poster #449 | |
PRaDA: Projective Radial Distortion Averaging
Poster Session 5
Daniil Sinitsyn · Linus Härenstam-Nielsen · Daniel Cremers
|
ExHall D Poster #81 | |
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization
Poster Session 5
Liang Liu · Shuaiyong Li · Yongqiang Zhu
|
ExHall D Poster #281 | |
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
Poster Session 1
Dongyue Lu · Lingdong Kong · Tianxin Huang · Gim Hee Lee
|
ExHall D Poster #141 | |
Frequency Dynamic Convolution for Dense Image Prediction
Poster Session 6
Linwei Chen · Lin Gu · Liang Li · Chenggang Yan · Ying Fu
|
ExHall D Poster #386 | |
TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond
Poster Session 6
Kun Zhou · Xinyu Lin · Jiangbo Lu
|
ExHall D Poster #187 | |
Prof. Robot: Differentiable Robot Rendering Without Static and Self-Collisions
Poster Session 5
Quanyuan Ruan · Jiabao Lei · Wenhao Yuan · Yanglin Zhang · Dekun Lu · Guiliang Liu · Kui Jia
|
ExHall D Poster #143 | |
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera
Jian Huang · Chengrui Dong · Xuanhua Chen · Peidong Liu
|
ExHall D Poster #75 | |
Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
Poster Session 6
Ting Liu · Siyuan Li
|
ExHall D Poster #333 | |
EigenGS Representation: From Eigenspace to Gaussian Image Space
Poster Session 3
LO-WEI TAI · Ching-En Ching En, Li · Cheng-Lin Chen · Chih-Jung Tsai · Hwann-Tzong Chen · Tyng-Luh Liu
|
ExHall D Poster #271 | |
LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning
Poster Session 3
Xuan Liu · Xiaobin Chang
|
ExHall D Poster #446 | |
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways
Yi Liu · Hao Zhou · Benlei Cui · Wenxiang Shang · Ran Lin
|
ExHall D Poster #212 | |
Uncertainty Weighted Gradients for Model Calibration
Poster Session 3
Jinxu Lin · Linwei Tao · Minjing Dong · Chang Xu
|
ExHall D Poster #464 | |
GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting
Shujuan Li · Yu-Shen Liu · Zhizhong Han
|
ExHall D Poster #92 | |
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining
Poster Session 2
Yunze Liu · Li Yi
|
ExHall D Poster #410 | |
Joint Scheduling of Causal Prompts and Tasks for Multi-Task Learning
Poster Session 5
Chaoyang Li · Jianyang Qin · Jinhao Cui · Zeyu Liu · Ning Hu · Qing Liao
|
ExHall D Poster #390 | |
UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming
Poster Session 5
Hao Lin · Ke Wu · Jie Li · Jun Li · Wu-Jun Li
|
ExHall D Poster #214 | |
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Poster Session 1
Sheng Zhou · Junbin Xiao · Qingyun Li · Yicong Li · Xun Yang · Dan Guo · Meng Wang · Tat-seng Chua · Angela Yao
|
ExHall D Poster #305 | |
GliaNet: Adaptive Neural Network Structure Learning with Glia-Driven
Poster Session 5
Mengqiao Han · Liyuan Pan · Xiabi Liu
|
ExHall D Poster #401 | |
IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing
Poster Session 3
Chun Gu · Xiaofei Wei · Zixuan Zeng · Yuxuan Yao · Li Zhang
|
ExHall D Poster #27 | |
Learning Textual Prompts for Open-World Semi-Supervised Learning
Poster Session 3
Yuxin Fan · Junbiao Cui · Jiye Liang
|
ExHall D Poster #393 | |
RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
Poster Session 5
Yunfei Long · Abhinav Kumar · Xiaoming Liu · Daniel Morris
|
ExHall D Poster #117 | |
Joint Vision-Language Social Bias Removal for CLIP
Poster Session 1
Haoyu Zhang · Yangyang Guo · Mohan Kankanhalli
|
ExHall D Poster #389 | |
Identifying and Mitigating Spurious Correlation in Multi-Task Learning
Poster Session 5
Junyi Chai · Shenyu Lu · Xiaoqian Wang
|
ExHall D Poster #446 | |
GazeGene: Large-scale Synthetic Gaze Dataset with 3D Eyeball Annotations
Poster Session 4
Yiwei Bao · Zhiming Wang · Feng Lu
|
ExHall D Poster #283 | |
Gaussian Splatting for Efficient Satellite Image Photogrammetry
Poster Session 2
Luca Savant Aira · Gabriele Facciolo · Thibaud Ehret
|
ExHall D Poster #49 | |
Bringing CLIP to the Clinic: Dynamic Soft Labels and Negation-Aware Learning for Medical Analysis
Poster Session 5
Hanbin Ko · Chang Min Park
|
ExHall D Poster #467 | |
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Poster Session 4
Lucas Ventura · Antoine Yang · Cordelia Schmid · Gul Varol
|
ExHall D Poster #301 | |
Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling
Poster Session 5
Guillem Font Font · Antonio Rubio · Luis Ferraz · Antonio Agudo
|
ExHall D Poster #135 | |
Relation3D : Enhancing Relation Modeling for Point Cloud Instance Segmentation
Poster Session 2
Edward LOO · Jiacheng Deng
|
ExHall D Poster #337 | |
Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models
Poster Session 5
Andreas Müller · Denis Lukovnikov · Jonas Thietke · Asja Fischer · Erwin Quiring
|
ExHall D Poster #256 | |
SLADE: Shielding against Dual Exploits in Large Vision-Language Models
Poster Session 5
Md Zarif Hossain · AHMED IMTEAJ
|
ExHall D Poster #308 | |
Style Quantization for Data-Efficient GAN Training
Poster Session 2
Jian Wang · Xin Lan · Ji-Zhe Zhou · Yuxin Tian · Jiancheng Lv
|
ExHall D Poster #223 | |
DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers
Poster Session 1
Li Ren · Chen Chen · Liqiang Wang · Kien A. Hua
|
ExHall D Poster #402 | |
Closest Neighbors are Harmful for Lightweight Masked Auto-encoders
Poster Session 5
Jian Meng · Ahmed Hasssan · Li Yang · Deliang Fan · Jinwoo Shin · Jae-sun Seo
|
ExHall D Poster #400 | |
Generative Photomontage
Poster Session 2
Sean J. Liu · Nupur Kumari · Ariel Shamir · Jun-Yan Zhu
|
ExHall D Poster #245 | |
RNG: Relightable Neural Gaussians
Poster Session 6
Jiahui Fan · Fujun Luan · Jian Yang · Milos Hasan · Beibei Wang
|
ExHall D Poster #35 | |
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification
Poster Session 5
Darryl Ho · Samuel Madden
|
ExHall D Poster #287 | |
Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?
Poster Session 3
Yuan-Hong Liao · Rafid Mahmood · Sanja Fidler · David Acuna
|
ExHall D Poster #385 | |
Compositional Targeted Multi-Label Universal Perturbations
Poster Session 4
Hassan Mahmood · Ehsan Elhamifar
|
ExHall D Poster #454 | |
Customized Condition Controllable Generation for Video Soundtrack
Poster Session 5
Fan Qi · KunSheng Ma · Changsheng Xu
|
ExHall D Poster #277 | |
FASTer: Focal token Acquiring-and-Scaling Transformer for Long-term 3D Objection Detection
Poster Session 4
Chenxu Dang · Pei An · Xinmin Zhang · ZaiPeng Duan · Xuzhong Hu · Jie Ma
|
ExHall D Poster #116 | |
SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction
Poster Session 2
ZaiPeng Duan · Xuzhong Hu · Pei An · Junfeng Ding · Jie Zhan · Chenxu Dang · Yunbiao Xu · Jie Ma
|
ExHall D Poster #132 | |
Be More Specific: Evaluating Object-centric Realism in Synthetic Images
Poster Session 6
Anqi Liang · Ciprian Adrian Corneanu · Qianli Feng · Giorgio Giannone · Aleix Martinez
|
ExHall D Poster #255 | |
Cheb-GR: Rethinking K-nearest Neighbor Search in Re-ranking for Person Re-identification
Poster Session 4
Jinxi Yang · He Li · Bo Du · Mang Ye
|
ExHall D Poster #330 | |
The Impact Label Noise and Choice of Threshold has on Cross-Entropy and Soft-Dice in Image Segmentation
Poster Session 4
Marcus Nordström · Atsuto Maki · Henrik Hult
|
ExHall D Poster #477 | |
DarkIR: Robust Low-Light Image Restoration
Poster Session 3
Daniel Feijoo · Juan C. Benito · Alvaro Garcia · Marcos Conde
|
ExHall D Poster #21 | |
FIction: 4D Future Interaction Prediction from Video
Kumar Ashutosh · Georgios Pavlakos · Kristen Grauman
|
ExHall D Poster #173 | |
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts
Poster Session 2
Adnen Abdessaied · Anna Rohrbach · Marcus Rohrbach · Andreas Bulling
|
ExHall D Poster #312 | |
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?
Poster Session 3
Martin Spitznagel · Jan Vaillant · Janis Keuper
|
ExHall D Poster #45 | |
Improved Monocular Depth Prediction Using Distance Transform Over Pre-semantic Contours with Self-supervised Neural Networks
Poster Session 5
Marwane Hariat · Antoine Manzanera · David Filliat
|
ExHall D Poster #78 | |
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving
Poster Session 3
Georg Hess · Carl Lindström · Maryam Fatemi · Christoffer Petersson · Lennart Svensson
|
ExHall D Poster #129 | |
A Distractor-Aware Memory for Visual Object Tracking with SAM2
Poster Session 5
Alan Lukezic · Jovana Videnović · Matej Kristan
|
ExHall D Poster #309 | |
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Poster Session 3
Marco Garosi · Alessandro Conti · Gaowen Liu · Elisa Ricci · Massimiliano Mancini
|
ExHall D Poster #426 | |
Zero-Shot Blind-spot Image Denoising via Implicit Neural Sampling
Poster Session 2
Yuhui Quan · Tianxiang Zheng · Zhiyuan Ma · Hui Ji
|
ExHall D Poster #204 | |
Fingerprinting Denoising Diffusion Probabilistic Models
Poster Session 6
Huan Teng · Yuhui Quan · Chengyu Wang · Jun Huang · Hui Ji
|
ExHall D Poster #252 | |
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table
Poster Session 6
Yusuke Matsui
|
ExHall D Poster #410 | |
OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection
Poster Session 5
Max Gutbrod · David Rauber · Danilo Weber Nunes · Christoph Palm
|
ExHall D Poster #465 | |
Realistic Test-Time Adaptation of Vision-Language Models
Maxime Zanella · Clément Fuchs · Christophe De Vleeschouwer · Ismail Ben Ayed
|
ExHall D Poster #388 | |
Temporally Consistent Object-Centric Learning by Contrasting Slots
Poster Session 2
Anna Manasyan · Maximilian Seitzer · Filip Radovic · Georg Martius · Andrii Zadaianchuk
|
ExHall D Poster #161 | |
MODA: Motion-Drift Augmentation for Inertial Human Motion Analysis
Poster Session 6
Yinghao Wu · Shihui Guo · Yipeng Qin
|
ExHall D Poster #153 | |
Saliuitl: Ensemble Salience Guided Recovery of Adversarial Patches against CNNs
Poster Session 4
Mauricio Byrd Victorica · György Dán · Henrik Sandberg
|
ExHall D Poster #434 | |
VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models
Poster Session 6
Dahun Kim · AJ Piergiovanni · Ganesh Satish Mallya · Anelia Angelova
|
ExHall D Poster #279 | |
PURA: Parameter Update-Recovery Test-Time Adaption for RGB-T Tracking
Poster Session 5
Zekai Shao · Yufan Hu · Bin Fan · Hongmin Liu
|
ExHall D Poster #99 | |
Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting
Poster Session 6
Kaouther Messaoud · Matthieu Cord · Alex Alahi
|
ExHall D Poster #134 | |
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary
Poster Session 1
Kevin Qinghong Lin · Mike Zheng Shou
|
ExHall D Poster #292 | |
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Poster Session 6
Matteo Farina · Massimiliano Mancini · Giovanni Iacca · Elisa Ricci
|
ExHall D Poster #367 | |
Personalized Preference Fine-tuning of Diffusion Models
Poster Session 2
Meihua Dang · Anikait Singh · Linqi Zhou · Stefano Ermon · Jiaming Song
|
ExHall D Poster #253 | |
Feature-Preserving Mesh Decimation for Normal Integration
Poster Session 2
Moritz Heep · Sven Behnke · Eduard Zell
|
ExHall D Poster #32 | |
SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation
Poster Session 4
Jihuai Zhao · Junbao Zhuo · Jiansheng Chen · Huimin Ma
|
ExHall D Poster #336 | |
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Poster Session 3
Aodi Li · Liansheng Zhuang · Xiao Long · MingHong Yao · Shafei Wang
|
ExHall D Poster #450 | |
Free Lunch Enhancements for Multi-modal Crowd Counting
Poster Session 3
Haoliang Meng · Xiaopeng Hong · Zhengqin Lai · Miao Shang
|
ExHall D Poster #322 | |
EquiPose: Exploiting Permutation Equivariance for Relative Camera Pose Estimation
Poster Session 1
Yuzhen Liu · Qiulei Dong
|
ExHall D Poster #89 | |
Exploring Timeline Control for Facial Motion Generation
Poster Session 1
Yifeng Ma · Jinwei Qi · Chaonan Ji · Peng Zhang · Bang Zhang · Zhidong Deng · Liefeng Bo
|
ExHall D Poster #164 | |
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
Hoonhee Cho · Jae-Young Kang · Youngho Kim · Kuk-Jin Yoon
|
ExHall D Poster #100 | |
DiffLO: Semantic-Aware LiDAR Odometry with Diffusion-Based Refinement
Poster Session 4
huang yongshu · Chen Liu · Minghang Zhu · Sheng Ao · Chenglu Wen · Cheng Wang
|
ExHall D Poster #118 | |
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Poster Session 6
Joya Chen · Yiqi Lin · Ziyun Zeng · Wei Li · Zejun Ma · Mike Zheng Shou
|
ExHall D Poster #281 | |
TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning
Poster Session 3
Seungmin Baek · Soyul Lee · Hayeon Jo · Hyesong Choi · Dongbo Min
|
ExHall D Poster #402 | |
Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
Poster Session 6
Johannes Schusterbauer · Ming Gui · Frank Fundel · Björn Ommer
|
ExHall D Poster #208 | |
HalLoc: Token-level Localization of Hallucinations for Vision Language Models
Poster Session 6
Eunkyu Park · Minyeong Kim · Gunhee Kim
|
ExHall D Poster #358 | |
Unveiling Differences in Generative Models: A Scalable Differential Clustering Approach
Poster Session 2
Jingwei Zhang · Mohammad Jalali · Cheuk Ting Li · Farzan Farnia
|
ExHall D Poster #276 | |
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
Poster Session 2
Mingjin Zhang · Xiaolong Li · Fei Gao · Jie Guo · Xinbo Gao · Jing Zhang
|
ExHall D Poster #398 | |
U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening
Poster Session 5
Sungpyo Kim · Jeonghyeok Do · Jaehyup Lee · Munchurl Kim
|
ExHall D Poster #191 | |
SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video
Poster Session 6
Jongmin Park · Minh-Quan Viet Bui · Juan Luis Gonzalez Bello · Jaeho Moon · Jihyong Oh · Munchurl Kim
|
ExHall D Poster #69 | |
Reanimating Images using Neural Representations of Dynamic Stimuli
Poster Session 2
Jacob Yeung · Andrew Luo · Gabriel Sarch · Margaret Marie Henderson · Deva Ramanan · Michael J. Tarr
|
ExHall D Poster #220 | |
Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis
Poster Session 5
Awais Nizamani · Hamid Laga · Guanjin Wang · Farid Boussaid · Mohammed Bennamoun · Anuj Srivastava
|
ExHall D Poster #70 | |
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Poster Session 2
Ziyang Luo · Haoning Wu · Dongxu Li · Jing Ma · Mohan Kankanhalli · Junnan Li
|
ExHall D Poster #295 | |
AnyMap: Learning a General Camera Model for Structure-from-Motion with Unknown Distortion in Dynamic Scenes
Poster Session 4
Andrea Porfiri Dal Cin · Georgi Dikov · Jihong Ju · Mohsen Ghafoorian
|
ExHall D Poster #81 | |
SGSST: Scaling Gaussian Splatting Style Transfer
Poster Session 6
Bruno Galerne · Jianling WANG · Lara Raad · Jean-michel Morel
|
ExHall D Poster #36 | |
FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs
Poster Session 3
Mothilal Asokan · Kebin wu · Fatima Albreiki
|
ExHall D Poster #367 | |
UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning
Poster Session 2
Long Zhou · Fereshteh Shakeri · Aymen Sadraoui · Mounir Kaaniche · Jean-Christophe Pesquet · Ismail Ben Ayed
|
ExHall D Poster #409 | |
HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks
Poster Session 5
Maria Pilligua · Danna Xue · Javier Vazquez-Corral
|
ExHall D Poster #178 | |
Neural Inverse Rendering from Propagating Light
Poster Session 3
Anagh Malik · Benjamin Attal · Andrew Xie · Matthew O’Toole · David B. Lindell
|
ExHall D Poster #30 | |
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Poster Session 5
Ahyun Seo · Minsu Cho
|
ExHall D Poster #101 | |
3D-HGS: 3D Half-Gaussian Splatting
Poster Session 3
Haolin Li · Jinyang Liu · Mario Sznaier · Octavia Camps
|
ExHall D Poster #33 | |
TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
Poster Session 6
Hongxiang Zhao · Xingchen Liu · Mutian Xu · Yiming Hao · Weikai Chen · Xiaoguang Han
|
ExHall D Poster #145 | |
Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Poster Session 3
Feifei Li · Mi Zhang · Yiming Sun · Min Yang
|
ExHall D Poster #248 | |
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
Hanxi Liu · Yifang Men · Zhouhui Lian
|
ExHall D Poster #11 | |
BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology
Poster Session 2
Amaya Gallagher-Syed · Henry Senior · Omnia Alwazzan · Elena Pontarini · Michele Bombardieri · Costantino Pitzalis · Myles J. Lewis · Michael R Barnes · Luca Rossi · Greg Slabaugh
|
ExHall D Poster #476 | |
FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing
Poster Session 2
Hossein Kashiani · Niloufar Alipour Talemi · Fatemeh Afghah
|
ExHall D Poster #325 | |
Optical-Flow Guided Prompt Optimization for Coherent Video Generation
Poster Session 2
Hyelin Nam · Jaemin Kim · Dohun Lee · Jong Chul Ye
|
ExHall D Poster #236 | |
QuCOOP: A Versatile Framework for Solving Composite and Binary-Parametrised Problems on Quantum Annealers
Natacha Kuete Meli · Vladislav Golyanik · Marcel Seelbach Benkner · Michael Moeller
|
ExHall D Poster #70 | |
ViiNeuS: Volumetric Initialization for Implicit Neural Surface Reconstruction of Urban Scenes with Limited Image Overlap
Poster Session 3
Hala Djeghim · Nathan Piasco · Moussab Bennehar · Luis Guillermo Roldao Jimenez · Dzmitry Tsishkou · Désiré Sidibé
|
ExHall D Poster #117 | |
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control
Poster Session 1
Basim Azam · Naveed Akhtar
|
ExHall D Poster #269 | |
The Illusion of Unlearning: The Unstable Nature of Machine Unlearning in Text-to-Image Diffusion Models
Poster Session 3
Naveen George · Karthik Nandan Dasaraju · Rutheesh Reddy Chittepu · Konda Reddy Mopuri
|
ExHall D Poster #261 | |
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement
Ian Huang · Yanan Bao · Karen Truong · Howard Zhou · Cordelia Schmid · Leonidas Guibas · Alireza Fathi
|
ExHall D Poster #269 | |
Learning to Filter Outlier Edges in Global SfM
Nicole Damblon · Marc Pollefeys · Daniel Barath
|
ExHall D Poster #87 | |
Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling
Poster Session 5
Nannan Li · Kevin Shih · Bryan A. Plummer
|
ExHall D Poster #18 | |
Generative Modeling of Class Probability for Multi-Modal Representation Learning
JungKyoo Shin · Bumsoo Kim · Eunwoo Kim
|
ExHall D Poster #469 | |
Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Poster Session 3
Jungin Park · Jiyoung Lee · Kwanghoon Sohn
|
ExHall D Poster #288 | |
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
Poster Session 2
Federico Cocchi · Nicholas Moratelli · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara
|
ExHall D Poster #365 | |
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects
Poster Session 2
Jaeguk Kim · Jaewoo Park · Keuntek Lee · Nam Ik Cho
|
ExHall D Poster #102 | |
Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization
Poster Session 6
Dongkwan Lee · Kyomin Hwang · Nojun Kwak
|
ExHall D Poster #426 | |
AnySat: One Earth Observation Model for Many Resolutions, Scales, and Modalities
Guillaume Astruc · Nicolas Gonthier · Clement Mallet · Loic Landrieu
|
ExHall D Poster #355 | |
Boost Your Human Image Generation Model via Direct Preference Optimization
Sanghyeon Na · Yonggyu Kim · Hyunjoon Lee
|
ExHall D Poster #238 | |
Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation
Poster Session 4
Yanda Chen · Gongwei Chen · Miao Zhang · Weili Guan · Liqiang Nie
|
ExHall D Poster #441 | |
It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data
Poster Session 5
Dominik Schnaus · Nikita Araslanov · Daniel Cremers
|
ExHall D Poster #377 | |
Multi-View Pose-Agnostic Change Localization with Zero Labels
Poster Session 3
Chamuditha Jayanga Galappaththige · Jason Lai · Lloyd Windrim · Donald G. Dansereau · Niko Suenderhauf · Dimity Miller
|
ExHall D Poster #92 | |
GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping
Poster Session 2
Jinfeng Liu · Lingtong Kong · Bo Li · Dan Xu
|
ExHall D Poster #52 | |
Mind the Gap: Detecting Black-box Adversarial Attacks in the Making through Query Update Analysis
Poster Session 2
Jeonghwan Park · Niall McLaughlin · Ihsen Alouani
|
ExHall D Poster #463 | |
Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation
Poster Session 2
Hyeonho Jeong · Chun-Hao P. Huang · Jong Chul Ye · Niloy J. Mitra · Duygu Ceylan
|
ExHall D Poster #183 | |
Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning
Poster Session 2
Dongyao Jiang · Haodong Jing · Yongqiang Ma · Nanning Zheng
|
ExHall D Poster #427 | |
MultiMorph: On-demand Atlas Construction
Poster Session 6
S. Mazdak Abulnaga · Andrew Hoopes · Neel Dey · Malte Hoffmann · Bruce Fischl · John Guttag · Adrian V. Dalca
|
ExHall D Poster #455 | |
Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models
Davide Berasi · Matteo Farina · Massimiliano Mancini · Elisa Ricci · Nicola Strisciuglio
|
ExHall D Poster #371 | |
Learning Physics From Video: Unsupervised Physical Parameter Estimation for Continuous Dynamical Systems
Poster Session 6
Alejandro Castañeda Garcia · Jan Warchocki · Jan van Gemert · Daan Brinks · Nergis Tomen
|
ExHall D Poster #167 | |
CholecTrack20: A Multi-Perspective Tracking Dataset for Surgical Tools
Poster Session 2
Chinedu Innocent Nwoye · Kareem elgohary · Anvita A. Srinivas · Fauzan Zaid · Joël L. Lavanchy · Nicolas Padoy
|
ExHall D Poster #342 | |
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
Poster Session 2
Hyeongjin Nam · Donghwan Kim · Jeongtaek Oh · Kyoung Mu Lee
|
ExHall D Poster #18 | |
DepthCues: Evaluating Monocular Depth Perception in Large Vision Models
Poster Session 4
Duolikun Danier · Mehmet Aygun · Changjian Li · Hakan Bilen · Oisin Mac Aodha
|
ExHall D Poster #405 | |
MVSAnywhere: Zero-Shot Multi-View Stereo
Poster Session 3
Sergio Izquierdo · Mohamed Sayed · Michael Firman · Guillermo Garcia-Hernando · Daniyar Turmukhambetov · Javier Civera · Oisin Mac Aodha · Gabriel Brostow · Jamie Watson
|
ExHall D Poster #81 | |
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
Katrin Renz · Long Chen · Elahe Arani · Oleg Sinavski
|
ExHall D Poster #130 | |
Keyframe-Guided Creative Video Inpainting
Poster Session 3
Yuwei Guo · Ceyuan Yang · Anyi Rao · Chenlin Meng · Omer Bar-Tal · Shuangrui Ding · Maneesh Agrawala · Dahua Lin · Bo Dai
|
ExHall D Poster #225 | |
Multi-Modal Contrastive Masked Autoencoders: A Two-Stage Progressive Pre-training Approach for RGBD Datasets
Poster Session 4
Muhammad Jamal Jamal · Omid Mohareri
|
ExHall D Poster #204 | |
Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation
Poster Session 5
Markus Karmann · Onay Urfalioglu
|
ExHall D Poster #333 | |
TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection
Poster Session 5
Yoon Gyo Jung · Jaewoo Park · Jaeho Yoon · Kuan-Chuan Peng · Wonchul Kim · Andrew Beng Jin Teoh · Octavia Camps
|
ExHall D Poster #430 | |
Tiled Diffusion
Poster Session 2
Or Madar · Ohad Fried
|
ExHall D Poster #232 | |
Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation
Poster Session 1
Nadav Z. Cohen · Oron Nir · Ariel Shamir
|
ExHall D Poster #237 | |
Stable Flow: Vital Layers for Training-Free Image Editing
Poster Session 2
Omri Avrahami · Or Patashnik · Ohad Fried · Egor Nemchinov · Kfir Aberman · Dani Lischinski · Daniel Cohen-Or
|
ExHall D Poster #240 | |
Graph-Based 3D Lane Detection from Monocular Images
Poster Session 6
Halil İbrahim Öztürk · Muhammet Esat Kalfaoglu · Ozsel Kilinc
|
ExHall D Poster #129 | |
GOAL: Global-local Object Alignment Learning
Poster Session 1
Hyungyu Choi · Young Kyun Jang · Chanho Eom
|
ExHall D Poster #372 | |
Order-One Rolling Shutter Cameras
Marvin Anas Hahn · Kathlén Kohn · Orlando Marigliano · Tomas Pajdla
|
ExHall D Poster #82 | |
Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations
Poster Session 1
Jiate Li · Meng Pang · Yun Dong · Binghui Wang
|
ExHall D Poster #464 | |
Dual Focus-Attention Transformer for Robust Point Cloud Registration
Poster Session 3
Kexue Fu · Ming'zhi Yuan · Changwei Wang · Weiguang Pang · Jing Chi · Manning Wang · Longxiang Gao
|
ExHall D Poster #108 | |
Any-Resolution AI-Generated Image Detection by Spectral Learning
Poster Session 4
Dimitrios Karageorgiou · Symeon Papadopoulos · Ioannis Kompatsiaris · Efstratios Gavves
|
ExHall D Poster #279 | |
Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models
Poster Session 6
Yuhao Cui · Xinxing Zu · Wenhua Zhang · Zhongzhou Zhao · Jinyang Gao
|
ExHall D Poster #344 | |
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors
Poster Session 4
Juil Koo · Paul Guerrero · Chun-Hao P. Huang · Duygu Ceylan · Minhyuk Sung
|
ExHall D Poster #180 | |
Motion Modes: What Could Happen Next?
Poster Session 1
Karran Pandey · Yannick Hold-Geoffroy · Matheus Gadelha · Niloy J. Mitra · Karan Singh · Paul Guerrero
|
ExHall D Poster #175 | |
A Tale of Two Classes: Adapting Supervised Contrastive Learning to Binary Imbalanced Datasets
Poster Session 2
David Mildenberger · Paul Hager · Daniel Rueckert · Martin J. Menten
|
ExHall D Poster #470 | |
Robust Multi-Object 4D Generation for In-the-wild Videos
Poster Session 5
Wen-Hsuan Chu · Lei Ke · Jianmeng Liu · Mingxiao Huo · Pavel Tokmakov · Katerina Fragkiadaki
|
ExHall D Poster #97 | |
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals
Poster Session 3
Changhao Peng
|
ExHall D Poster #109 | |
Descriptor-In-Pixel : Point-Feature Tracking For Pixel Processor Arrays
Poster Session 2
Laurie Bose · Piotr Dudek · Jianing Chen
|
ExHall D Poster #88 | |
Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video
Poster Session 2
Marchellus Matthew · Nadhira Noor · In Kyu Park
|
ExHall D Poster #72 | |
PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation
Poster Session 2
HsiaoYuan Hsu · Yuxin Peng
|
ExHall D Poster #262 | |
Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning
Poster Session 4
Maosen Zhao · Pengtao Chen · Chong Yu · Yan Wen · Xudong Tan · Tao Chen
|
ExHall D Poster #222 | |
Learning with Noisy Triplet Correspondence for Composed Image Retrieval
Poster Session 4
Shuxian Li · Changhao He · XitingLiu · Joey Tianyi Zhou · Xi Peng · Peng Hu
|
ExHall D Poster #364 | |
DKC: Differentiated Knowledge Consolidation for Cloth-Hybrid Lifelong Person Re-identification
Poster Session 1
Zhenyu Cui · Jiahuan Zhou · Yuxin Peng
|
ExHall D Poster #324 | |
Global-Local Tree Search in VLMs for 3D Indoor Scene Generation
Poster Session 2
Wei Deng · Mengshi Qi · Huadong Ma
|
ExHall D Poster #345 | |
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Poster Session 5
Taeyoung Yun · Dinghuai Zhang · Jinkyoo Park · Ling Pan
|
ExHall D Poster #248 | |
FastVLM: Efficient Vision Encoding for Vision Language Models
Poster Session 4
Pavan Vasu Vasu · Fartash Faghri · Chun-Liang Li · Cem Koc · Nate True · Gokula Krishnan Santhanam · Albert Antony · James Gabriel · Peter Grasch · Oncel Tuzel · Hadi Pouransari
|
ExHall D Poster #378 | |
Universal Domain Adaptation for Semantic Segmentation
Poster Session 1
Seun-An Choe · Keon Hee Park · Jinwoo Choi · Gyeong-Moon Park
|
ExHall D Poster #425 | |
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Poster Session 3
Pedro Hermosilla · Christian Stippel · Leon Sick
|
ExHall D Poster #400 | |
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Poster Session 4
Hyunjun Lee · Hyunsoo Lee · Sookwan Han
|
ExHall D Poster #163 | |
Towards Human-Understandable Multi-Dimensional Concept Discovery
Poster Session 4
Arne Grobrügge · Niklas Kühl · Gerhard Satzger · Philipp Spitzer
|
ExHall D Poster #402 | |
Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware Minimization for Federated Learning
Poster Session 5
Debora Caldarola · Pietro Cagnasso · Barbara Caputo · Marco Ciccone
|
ExHall D Poster #396 | |
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation
Poster Session 2
Gianni Franchi · Nacim Belkhir · Dat NGUYEN · Guoxuan Xia · Andrea Pilzer
|
ExHall D Poster #257 | |
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
Poster Session 3
Sanghyeok Lee · Joonmyung Choi · Hyunwoo J. Kim
|
ExHall D Poster #408 | |
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
Poster Session 3
Chaoyu Li · Eun Woo Im · Pooyan Fazli
|
ExHall D Poster #294 | |
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
Poster Session 1
Dohun Lee · Bryan Sangwoo Kim · Geon Yeong Park · Jong Chul Ye
|
ExHall D Poster #233 | |
Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization
Poster Session 2
Peirong Liu · Ana Lawry Aguila · Juan Iglesias
|
ExHall D Poster #484 | |
PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches
Poster Session 2
Dennis Jacob · Chong Xiang · Prateek Mittal
|
ExHall D Poster #435 | |
RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models
Poster Session 5
Greg Heinrich · Mike Ranzinger · Danny Yin · Yao Lu · Jan Kautz · Bryan Catanzaro · Andrew Tao · Pavlo Molchanov
|
ExHall D Poster #136 | |
Functionality Understanding and Segmentation in 3D Scenes
Poster Session 5
Jaime Corsetti · Francesco Giuliari · Alice Fasoli · Davide Boscaini · Fabio Poiesi
|
ExHall D Poster #336 | |
F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics
Pramit Saha · Felix Wagner · Divyanshu Mishra · Can Peng · Anshul Thakur · David A. Clifton · Konstantinos Kamnitsas · Alison Noble
|
ExHall D Poster #401 | |
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
Poster Session 4
Eric Xing · Pranavi Kolouju · Robert Pless · Abby Stylianou · Nathan Jacobs
|
ExHall D Poster #365 | |
Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation
Poster Session 3
Hadi Alzayer · Philipp Henzler · Jonathan T. Barron · Jia-Bin Huang · Pratul P. Srinivasan · Dor Verbin
|
ExHall D Poster #26 | |
AniGrad: Anisotropic Gradient-Adaptive Sampling for 3D Reconstruction From Monocular Video
Poster Session 5
Noah Stier · Alex Rich · Pradeep Sen · Tobias Höllerer
|
ExHall D Poster #73 | |
Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels
Poster Session 3
Pierre Vuillecard · Jean-marc Odobez
|
ExHall D Poster #273 | |
Towards More General Video-based Deepfake Detection through Facial Component Guided Adaptation for Foundation Model
Poster Session 5
Yue-Hua Han · Tai-Ming Huang · Kailung Hua · Jun-Cheng Chen
|
ExHall D Poster #184 | |
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation
Poster Session 6
Miroslav Purkrábek · Jiri Matas
|
ExHall D Poster #93 | |
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Poster Session 1
Haiyang Mei · Pengyu Zhang · Mike Zheng Shou
|
ExHall D Poster #310 | |
Integral Fast Fourier Color Constancy
Poster Session 6
Wenjun Wei · Yanlin Qian · Huaian Chen · Junkang Dai · Yi Jin
|
ExHall D Poster #22 | |
CacheQuant: Comprehensively Accelerated Diffusion Models
Poster Session 5
Xuewen Liu · Zhikai Li · Qingyi Gu
|
ExHall D Poster #211 | |
CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation
Poster Session 3
Zhenhui Ding · Guilian Chen · Qin Zhang · Huisi Wu · Jing Qin
|
ExHall D Poster #477 | |
Deterministic Image-to-Image Translation via Denoising Brownian Bridge Models with Dual Approximators
Poster Session 6
Bohan Xiao · PEIYONG WANG · Qisheng He · Ming Dong
|
ExHall D Poster #197 | |
Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition
Poster Session 3
Anqi Zhu · Jingmin Zhu · James Bailey · Mingming Gong · Qiuhong Ke
|
ExHall D Poster #308 | |
ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation
Poster Session 4
Tao Tan · Qiulei Dong
|
ExHall D Poster #96 | |
T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving
Poster Session 4
Changsheng Lv · Mengshi Qi · Liang Liu · Huadong Ma
|
ExHall D Poster #132 | |
Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
Poster Session 2
Jeeyung Kim · Erfan Esmaeili Fakhabi · Qiang Qiu
|
ExHall D Poster #254 | |
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Zichen Miao · WEI CHEN · Qiang Qiu
|
ExHall D Poster #414 | |
ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
Qirui Huang · Runze Zhang · Kangjun Liu · Minglun Gong · Hao Zhang · Hui Huang
|
ExHall D Poster #114 | |
LT3SD: Latent Trees for 3D Scene Diffusion
Poster Session 1
Quan Meng · Lei Li · Matthias Nießner · Angela Dai
|
ExHall D Poster #45 | |
Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers
Poster Session 3
Quentin Guimard · Moreno D'Incà · Massimiliano Mancini · Elisa Ricci
|
ExHall D Poster #431 | |
EVPGS: Enhanced View Prior Guidance for Splatting-based Extrapolated View Synthesis
Poster Session 4
Jiahe Li · Feiyu Wang · Xiaochao Qu · WU CHENGJING · Luoqi Liu · Ting Liu
|
ExHall D Poster #53 | |
EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Gaoxiang Cong · Jiadong Pan · Liang Li · Yuankai Qi · Yuxin Peng · Anton van den Hengel · Jian Yang · Qingming Huang
|
ExHall D Poster #1 | |
Explainable Saliency: Articulating Reasoning with Contextual Prioritization
Poster Session 2
Nuo Chen · Ming Jiang · Qi Zhao
|
ExHall D Poster #403 | |
EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation
Md Mostafijur Rahman · Radu Marculescu
|
ExHall D Poster #482 | |
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
Poster Session 6
Aniket Rajiv Didolkar · Andrii Zadaianchuk · Rabiul Awal · Maximilian Seitzer · Efstratios Gavves · Aishwarya Agrawal
|
ExHall D Poster #322 | |
ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge
Poster Session 3
Radu Berdan · Beril Besbinar · Christoph Reinders · Junji Otsuka · Daisuke Iso
|
ExHall D Poster #115 | |
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction
Poster Session 6
Eduard Poesina · Adriana Valentina Costache · Adrian-Gabriel Chifu · Josiane Mothe · Radu Tudor Ionescu
|
ExHall D Poster #237 | |
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
Poster Session 1
Andrei Dumitriu · Florin Tatui · Florin Miron · Aakash Ralhan · Radu Tudor Ionescu · Radu Timofte
|
ExHall D Poster #311 | |
Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent
Poster Session 2
Philip Doldo · Derek Everett · Amol Khanna · Andre T Nguyen · Edward Raff
|
ExHall D Poster #95 | |
Geometry in Style: 3D Stylization via Surface Normal Deformation
Poster Session 6
Nam Anh Dinh · Itai Lang · Hyunwoo Kim · Oded Stein · Rana Hanocka
|
ExHall D Poster #219 | |
Locally Orderless Images for Optimization in Differentiable Rendering
Poster Session 2
Ishit Mehta · Manmohan Chandraker · Ravi Ramamoorthi
|
ExHall D Poster #30 | |
Binarized Neural Network for Multi-spectral Image Fusion
Poster Session 1
Junming Hou · Xiaoyu Chen · Ran Ran · Xiaofeng Cong · Xinyang Liu · Jian Wei You · Liang-Jian Deng
|
ExHall D Poster #194 | |
Sketchtopia: A Dataset and Foundational Agents for Benchmarking Asynchronous Multimodal Communication with Iconic Feedback
Poster Session 4
Mohd Hozaifa Khan · Ravi Kiran Sarvadevabhatla
|
ExHall D Poster #226 | |
ArtFormer: Controllable Generation of Diverse 3D Articulated Objects
Poster Session 1
Jiayi Su · Youhe Feng · Zheng Li · Jinhua Song · Yangfan He · Botao Ren · Botian Xu
|
ExHall D Poster #160 | |
Extreme Rotation Estimation in the Wild
Poster Session 1
Hana Bezalel · Dotan Ankri · Ruojin Cai · Hadar Averbuch-Elor
|
ExHall D Poster #83 | |
Condensing Action Segmentation Datasets via Generative Network Inversion
Poster Session 4
Guodong Ding · Rongyu Chen · Angela Yao
|
ExHall D Poster #184 | |
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving
Poster Session 1
R.D. Lin · Pengcheng Weng · Yinqiao Wang · Han Ding · Jinsong Han · Fei Wang
|
ExHall D Poster #118 | |
Reconstructing People, Places, and Cameras
Lea Müller · Hongsuk Choi · Anthony Zhang · Brent Yi · Jitendra Malik · Angjoo Kanazawa
|
ExHall D Poster #86 | |
OW-OVD: Unified Open World and Open Vocabulary Object Detection
Poster Session 5
Xing Xi · Yangyang Huang · Ronghua Luo · Yu Qiu
|
ExHall D Poster #421 | |
LIM: Large Interpolator Model for Dynamic Reconstruction
Poster Session 2
Remy Sabathier · Niloy J. Mitra · David Novotny
|
ExHall D Poster #68 | |
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval
Poster Session 3
Mankeerat Sidhu · Hetarth Chopra · Ansel Blume · Jeonghwan Kim · Revanth Gangi Reddy · Heng Ji
|
ExHall D Poster #429 | |
Hand-held Object Reconstruction from RGB Video with Dynamic Interaction
Poster Session 3
Shijian Jiang · Qi Ye · Rengan Xie · Yuchi Huo · Jiming Chen
|
ExHall D Poster #151 | |
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations
Poster Session 5
Krishna Sri Ipsit Mantri · Carola-Bibiane Schönlieb · Bruno Ribeiro · Chaim Baskin · Moshe Eliasof
|
ExHall D Poster #399 | |
LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models
Poster Session 3
Xuan Cai · Renjie Pan · Hua Yang
|
ExHall D Poster #403 | |
Feature Selection for Latent Factor Models
Poster Session 6
Rittwika Kansabanik · Adrian Barbu
|
ExHall D Poster #440 | |
Targeted Forgetting of Image Subgroups in CLIP Models
Poster Session 2
Zeliang Zhang · Gaowen Liu · Charles Fleming · Ramana Kompella · Chenliang Xu
|
ExHall D Poster #428 | |
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
Poster Session 2
Junsung Park · HwiJeong Lee · Inha Kang · Hyunjung Shim
|
ExHall D Poster #126 | |
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
Riku Murai · Eric Dexheimer · Andrew J. Davison
|
ExHall D Poster #83 | |
DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos
Lorenzo Mur-Labadia · Jose J. Guerrero · Ruben Martinez-Cantin
|
ExHall D Poster #315 | |
Data Distributional Properties As Inductive Bias for Systematic Generalization
Poster Session 5
Felipe del Rio · Alain Raymond · Daniel Florea · Rodrigo Toro Icarte · Julio Hurtado · Cristian Buc Calderon · Alvaro Soto
|
ExHall D Poster #435 | |
ParaHome: Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions
Poster Session 1
Jeonghwan Kim · Jisoo Kim · Jeonghyeon Na · Hanbyul Joo
|
ExHall D Poster #153 | |
OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities
Suyoung Lee · JAEYOUNG CHUNG · Kihoon Kim · Jaeyoo Huh · Gunhee Lee · Minsoo Lee · Kyoung Mu Lee
|
ExHall D Poster #49 | |
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Poster Session 4
Luke Rowe · Roger Girgis · Anthony Gosselin · Liam Paull · Christopher Pal · Felix Heide
|
ExHall D Poster #133 | |
PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies
Poster Session 4
Mojtaba Nafez · Amirhossein Koochakian · Arad Maleki · Jafar Habibi · Mohammad Rohban
|
ExHall D Poster #436 | |
OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP
Poster Session 2
Mohamad Hassan N C · Divyam Gupta · Mainak Singha · SAI BHARGAV RONGALI · Ankit Jha · Muhammad Haris Khan · Biplab Banerjee
|
ExHall D Poster #451 | |
ScribbleLight: Single Image Indoor Relighting with Scribbles
Poster Session 2
Jun Myeong Choi · Annie N. Wang · Pieter Peers · Anand Bhattad · Roni Sengupta
|
ExHall D Poster #26 | |
Hardware-Rasterized Ray-Based Gaussian Splatting
Samuel Rota Bulò · Lorenzo Porzi · Nemanja Bartolovic · Peter Kontschieder
|
ExHall D Poster #30 | |
Fortifying Federated Learning Towards Trustworthiness via Auditable Data Valuation and Verifiable Client Contribution
Poster Session 1
Naveen Kumar Kummari · Ranjeet Ranjan Jha · Krishna Mohan Chalavadi · Ravindra Babu Tallamraju
|
ExHall D Poster #462 | |
Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
Poster Session 6
Itay Benou · Tammy Riklin Raviv
|
ExHall D Poster #374 | |
Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Poster Session 2
Zaijing Li · Yuquan Xie · Rui Shao · Gongwei Chen · Dongmei Jiang · Liqiang Nie
|
ExHall D Poster #351 | |
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
Poster Session 1
Wei Li · Bing Hu · Rui Shao · Leyang Shen · Liqiang Nie
|
ExHall D Poster #294 | |
Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding
Poster Session 1
Wenbo Chen · Zhen Xu · Ruotao Xu · Si Wu · Hau San Wong
|
ExHall D Poster #358 | |
A Universal Scale-Adaptive Deformable Transformer for Image Restoration across Diverse Artifacts
Poster Session 3
Xuyi He · Yuhui Quan · Ruotao Xu · Hui Ji
|
ExHall D Poster #199 | |
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
Poster Session 6
Rundi Wu · Ruiqi Gao · Ben Poole · Alex Trevithick · Changxi Zheng · Jonathan T. Barron · Aleksander Holynski
|
ExHall D Poster #53 | |
SimVS: Simulating World Inconsistencies for Robust View Synthesis
Poster Session 4
Alex Trevithick · Roni Paiss · Philipp Henzler · Dor Verbin · Rundi Wu · Hadi Alzayer · Ruiqi Gao · Ben Poole · Jonathan T. Barron · Aleksander Holynski · Ravi Ramamoorthi · Pratul P. Srinivasan
|
ExHall D Poster #60 | |
Open Set Label Shift with Test Time Out-of-Distribution Reference
Poster Session 6
Changkun Ye · Russell Tsuchida · Lars Petersson · Nick Barnes
|
ExHall D Poster #428 | |
PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection
Poster Session 2
Wei Li · Pin-Yu Chen · Sijia Liu · Ren Wang
|
ExHall D Poster #465 | |
Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute Loss
Poster Session 5
Ravishankar Evani · Deepu Rajan · Shangbo Mao
|
ExHall D Poster #226 | |
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention
Poster Session 6
Saad Wazir · Daeyoung Kim
|
ExHall D Poster #451 | |
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
Poster Session 1
Junyan Lin · Haoran Chen · Yue Fan · Yingqi Fan · Xin Jin · Hui Su · Jinlan Fu · Xiaoyu Shen
|
ExHall D Poster #380 | |
Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D Motion
Poster Session 5
Saad Lahlali · Sandra Kara · Hejer AMMAR · Florian Chabot · Nicolas Granger · Hervé Le Borgne · Quoc Cuong PHAM
|
ExHall D Poster #334 | |
Locality-Aware Zero-Shot Human-Object Interaction Detection
Poster Session 4
Sanghyun Kim · Deunsol Jung · Minsu Cho
|
ExHall D Poster #418 | |
ShowMak3r: Compositional TV Show Reconstruction
Poster Session 1
Sangmin Kim · Seunguk Do · Jaesik Park
|
ExHall D Poster #65 | |
Spectral State Space Model for Rotation-Invariant Visual Representation Learning
Poster Session 5
Sahar Dastani · Ali Bahri · Moslem Yazdanpanah · Mehrdad Noori · David OSOWIECHI · Gustavo Vargas Hakim · Farzad Beizaee · Milad Cheraghalikhani · Arnab Mondal · Herve Lombaert · Christian Desrosiers
|
ExHall D Poster #274 | |
Spectral Informed Mamba for Robust Point Cloud Processing
Poster Session 3
Ali Bahri · Moslem Yazdanpanah · Mehrdad Noori · Sahar Dastani · Milad Cheraghalikhani · David OSOWIECHI · Gustavo Vargas Hakim · Farzad Beizaee · Ismail Ben Ayed · Christian Desrosiers
|
ExHall D Poster #112 | |
AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
Poster Session 4
Xiangyu Chang · Fahim Faisal Niloy · Sk Miraj Ahmed · Srikanth Krishnamurthy · Basak Guler · Ananthram Swami · Samet Oymak · Amit K. Roy-Chowdhury
|
ExHall D Poster #453 | |
Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
Poster Session 3
Soikat Hasan Ahmed · Jan Finkbeiner · Emre Neftci
|
ExHall D Poster #317 | |
Color Alignment in Diffusion
Poster Session 6
Ka Chun SHUM · Binh-Son Hua · Thanh Nguyen · Sai-Kit Yeung
|
ExHall D Poster #218 | |
Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
Poster Session 5
Shahad Albastaki · Anabia Sohail · IYYAKUTTI IYAPPAN GANAPATHI · Basit Alawode · Asim Khan · Sajid Javed · Naoufel Werghi · Mohammed Bennamoun · Arif Mahmood
|
ExHall D Poster #468 | |
Denoising Functional Maps: Diffusion Models for Shape Correspondence
Poster Session 6
Aleksei Zhuravlev · Zorah Lähner · Vladislav Golyanik
|
ExHall D Poster #72 | |
Simpler Diffusion: 1.5 FID on ImageNet512 with Pixel-space Diffusion
Poster Session 4
Emiel Hoogeboom · Thomas Mensink · Jonathan Heek · Kay Lamerigts · Ruiqi Gao · Tim Salimans
|
ExHall D Poster #215 | |
HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Mehdi Zayene · Albias Havolli · Jannik Endres · Charles Corbière · Alexandre Ben Ahmed Kontouli · Salim Cherkaoui · Alex Alahi
|
ExHall D Poster #79 | |
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Poster Session 3
Sagar Soni · Akshay Dudhane · Hiyam Debary · Mustansar Fiaz · Muhammad Akhtar Munir · Muhammad Sohail Danish · Paolo Fraccaro · Campbell D Watson · Levente Klein · Fahad Shahbaz Khan · Salman Khan
|
ExHall D Poster #349 | |
ZoomLDM: Latent Diffusion Model for Multi-scale Image Generation
Poster Session 5
Srikar Yellapragada · Alexandros Graikos · Kostas Triaridis · Prateek Prasanna · Rajarsi Gupta · Joel Saltz · Dimitris Samaras
|
ExHall D Poster #229 | |
LiSu: A Dataset and Method for LiDAR Surface Normal Estimation
Poster Session 4
Dušan Malić · Christian Fruhwirth-Reisinger · Samuel Schulter · Horst Possegger
|
ExHall D Poster #117 | |
GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection
Poster Session 6
Dušan Malić · Christian Fruhwirth-Reisinger · Samuel Schulter · Horst Possegger
|
ExHall D Poster #115 | |
PEER Pressure: Model-to-Model Regularization for Single Source Domain Generalization
Poster Session 3
Dongkyu Cho · Inwoo Hwang · Sanghack Lee
|
ExHall D Poster #451 | |
RAD: Region-Aware Diffusion Models for Image Inpainting
Poster Session 1
Sora Kim · Sungho Suh · Minsik Lee
|
ExHall D Poster #216 | |
GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks
Poster Session 1
Haoqiang Kang · Enna Sachdeva · Piyush Gupta · Sangjae Bae · Kwonjoon Lee
|
ExHall D Poster #347 | |
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
Fiona Ryan · Ajay Bati · Sangmin Lee · Daniel Bolya · Judy Hoffman · James Rehg
|
ExHall D Poster #258 | |
Learning to Highlight Audio by Watching Movies
Poster Session 5
Chao Huang · Ruohan Gao · J. M. F. Tsang · Jan Kurcius · Cagdas Bilen · Chenliang Xu · Anurag Kumar · Sanjeel Parekh
|
ExHall D Poster #278 | |
O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models
Ashshak Sharifdeen · Muhammad Akhtar Munir · Sanoojan Baliah · Salman Khan · Muhammad Haris Khan
|
ExHall D Poster #394 | |
RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions
Poster Session 3
Shihang Du · Sanqing Qu · Tianhang Wang · Xudong Zhang · Yunwei Zhu · Jian Mao · Fan Lu · Qiao Lin · Guang Chen
|
ExHall D Poster #122 | |
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
Poster Session 2
Davide Caffagni · Sara Sarto · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara
|
ExHall D Poster #373 | |
GeoMM: On Geodesic Perspective for Multi-modal Learning
Poster Session 1
Shibin Mei · Hang Wang · Bingbing Ni
|
ExHall D Poster #441 | |
LATTE-MV: Learning to Anticipate Table Tennis Hits from Monocular Videos
Poster Session 2
Daniel Etaat · Dvij Rajesh Kalaria · Nima Rahmanian · Shankar Sastry
|
ExHall D Poster #168 | |
How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions
Aditya Prakash · Benjamin E Lundell · Dmitry Andreychuk · David Forsyth · Saurabh Gupta · Harpreet S. Sawhney
|
ExHall D Poster #158 | |
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects
Poster Session 5
Yue Fan · Ningjing Fan · Ivan Skorokhodov · Oleg Voynov · Savva Ignatyev · Evgeny Burnaev · Peter Wonka · Yiqun Wang
|
ExHall D Poster #26 | |
Potential Field Based Deep Metric Learning
Poster Session 5
Shubhang Bhatnagar · Narendra Ahuja
|
ExHall D Poster #431 | |
Improving Semi-Supervised Semantic Segmentation with Sliced-Wasserstein Feature Alignment and Uniformity
Poster Session 4
Chen Yi Lu · Kasra Derakhshandeh · Somali Chaterji
|
ExHall D Poster #422 | |
Exploiting Deblurring Networks for Radiance Fields
Poster Session 2
Haeyun Choi · Heemin Yang · Janghyeok Han · Sunghyun Cho
|
ExHall D Poster #54 | |
Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering
Poster Session 4
Wenlong Fang · Qiaofeng Wu · Jing Chen · Yun Xue
|
ExHall D Poster #361 | |
EBS-EKF: Accurate and High Frequency Event-based Star Tracking
Albert Reed · Connor Hashemi · Dennis Melamed · Nitesh Menon · Keigo Hirakawa · Scott McCloskey
|
ExHall D Poster #108 | |
PLeaS - Merging Models with Permutations and Least Squares
Poster Session 6
Anshul Nasery · Jonathan Hayase · Pang Wei Koh · Sewoong Oh
|
ExHall D Poster #416 | |
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
Poster Session 1
Wenhui Liao · Jiapeng Wang · Hongliang Li · Chengyu Wang · Jun Huang · Lianwen Jin
|
ExHall D Poster #368 | |
FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution
Poster Session 6
Junyang Chen · Jinshan Pan · Jiangxin Dong
|
ExHall D Poster #193 | |
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Poster Session 2
Ethan Griffiths · Maryam Haghighat · Simon Denman · Clinton Fookes · Milad Ramezani
|
ExHall D Poster #122 | |
Efficient Video Super-Resolution for Real-time Rendering with Decoupled G-buffer Guidance
Poster Session 3
Mingjun Zheng · Long Sun · Jiangxin Dong · Jinshan Pan
|
ExHall D Poster #64 | |
Efficient Visual State Space Model for Image Deblurring
Poster Session 3
Lingshun Kong · Jiangxin Dong · Jinhui Tang · Ming-Hsuan Yang · Jinshan Pan
|
ExHall D Poster #197 | |
CrossOver: 3D Scene Cross-Modal Alignment
Poster Session 2
Sayan Deb Sarkar · Ondrej Miksik · Marc Pollefeys · Daniel Barath · Iro Armeni
|
ExHall D Poster #346 | |
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba
Poster Session 3
Xiaoyong Lu · Songlin Du
|
ExHall D Poster #409 | |
Towards Optimizing Large-Scale Multi-Graph Matching in Bioimaging
Poster Session 3
Max Kahl · Sebastian Stricker · Lisa Hutschenreiter · Florian Bernard · Carsten Rother · Bogdan Savchynskyy
|
ExHall D Poster #89 | |
Semantic and Expressive Variations in Image Captions Across Languages
Poster Session 6
Andre Ye · Sebastin Santy · Jena D. Hwang · Amy X Zhang · Ranjay Krishna
|
ExHall D Poster #337 | |
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding
Poster Session 2
Duo Zheng · Shijia Huang · Liwei Wang
|
ExHall D Poster #347 | |
Video-Guided Foley Sound Generation with Multimodal Controls
Poster Session 4
Ziyang Chen · Prem Seetharaman · Bryan Russell · Oriol Nieto · David Bourgin · Andrew Owens · Justin Salamon
|
ExHall D Poster #285 | |
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
Poster Session 4
Tanveer Hannan · Md Mohaiminul Islam · Jindong Gu · Thomas Seidl · Gedas Bertasius
|
ExHall D Poster #307 | |
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Poster Session 2
Zhanzhong Pang · Fadime Sener · Angela Yao
|
ExHall D Poster #318 | |
EntitySAM: Segment Everything in Video
Poster Session 5
Mingqiao Ye · Seoung Wug Oh · Lei Ke · Joon-Young Lee
|
ExHall D Poster #307 | |
Multi-modal Knowledge Distillation-based Human Trajectory Forecasting
Poster Session 5
Jaewoo Jeong · Seohee Lee · Daehee Park · Giwon Lee · Kuk-Jin Yoon
|
ExHall D Poster #306 | |
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Poster Session 3
Chanyoung Kim · Dayun Ju · Woojung Han · Ming-Hsuan Yang · Seong Jae Hwang
|
ExHall D Poster #420 | |
Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis
Poster Session 4
Woojung Han · Yeonkyung Lee · Chanyoung Kim · Kwanghyun Park · Seong Jae Hwang
|
ExHall D Poster #249 | |
Video Motion Transfer with Diffusion Transformers
Poster Session 5
Alexander Pondaven · Aliaksandr Siarohin · Sergey Tulyakov · Philip H.S. Torr · Fabio Pizzati
|
ExHall D Poster #176 | |
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Poster Session 2
Guocheng Qian · Kuan-Chieh Wang · Or Patashnik · Negin Heravi · Daniil Ostashev · Sergey Tulyakov · Daniel Cohen-Or · Kfir Aberman
|
ExHall D Poster #326 | |
L-SWAG: Layer-Sample Wise Activation with Gradients Information for Zero-Shot NAS on Vision Transformers
Poster Session 1
Sofia Casarin · Sergio Escalera · Oswald Lanz
|
ExHall D Poster #410 | |
MixerMDM: Learnable Composition of Human Motion Diffusion Models
Poster Session 3
Pablo Ruiz-Ponce · German Barquero · Cristina Palmero · Sergio Escalera · Jose Garcia-Rodriguez
|
ExHall D Poster #165 | |
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
Poster Session 1
Savya Khosla · Sethuraman T V · Alexander G. Schwing · Derek Hoiem
|
ExHall D Poster #336 | |
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
Poster Session 2
Jiho Choi · Seonho Lee · Minhyun Lee · Seungho Lee · Hyunjung Shim
|
ExHall D Poster #420 | |
Mixture of Submodules for Domain Adaptive Person Search
Poster Session 3
Minsu Kim · Seungryong Kim · Kwanghoon Sohn
|
ExHall D Poster #320 | |
ETAP: Event-based Tracking of Any Point
Poster Session 6
Friedhelm Hamann · Daniel Gehrig · Filbert Febryanto · Kostas Daniilidis · Guillermo Gallego
|
ExHall D Poster #99 | |
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Poster Session 5
Soumya Suvra Ghosal · Souradip Chakraborty · Vaibhav Singh · Tianrui Guan · Mengdi Wang · Ahmad Beirami · Furong Huang · Alvaro Velasquez · Dinesh Manocha · Amrit Singh Bedi
|
ExHall D Poster #382 | |
Joint Out-of-Distribution Filtering and Data Discovery Active Learning
Poster Session 5
Sebastian Schmidt · Leonard Schenk · Leo Schwinn · Stephan Günnemann
|
ExHall D Poster #444 | |
Hybrid Concept Bottleneck Models
Poster Session 4
Yang Liu · Tianwei Zhang · Shi Gu
|
ExHall D Poster #417 | |
DeepCompress-ViT: Rethinking Model Compression to Enhance Efficiency of Vision Transformers at the Edge
Poster Session 6
Sabbir Ahmed · Abdullah Al Arafat · Deniz Najafi · Akhlak Mahmood · Mamshad Nayeem Rizve · Mohaiminul Al Nahian · RANYANG ZHOU · Shaahin Angizi · Adnan Rakin Rakin
|
ExHall D Poster #382 | |
Variance-Based Membership Inference Attacks Against Large-Scale Image Captioning Models
Poster Session 2
Daniel Samira · Edan Habler · Yuval Elovici · Asaf Shabtai
|
ExHall D Poster #366 | |
HyperGS: Hyperspectral 3D Gaussian Splatting
Poster Session 2
Christopher Thirgood · Oscar Mendez · Erin Chao Ling · Jonathan Storey · Simon Hadfield
|
ExHall D Poster #50 | |
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models
Poster Session 2
Chen Chen · Daochang Liu · Mubarak Shah · Chang Xu
|
ExHall D Poster #268 | |
Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects
Poster Session 4
Shalini Maiti · Lourdes Agapito · Filippos Kokkinos
|
ExHall D Poster #265 | |
MatAnyone: Stable Video Matting with Consistent Memory Propagation
Poster Session 2
Peiqing Yang · Shangchen Zhou · Jixin Zhao · Qingyi Tao · Chen Change Loy
|
ExHall D Poster #185 | |
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
Poster Session 4
jiajun cao · Yuan Zhang · Tao Huang · Ming Lu · Qizhe Zhang · Ruichuan An · Ningning Ma · Shanghang Zhang
|
ExHall D Poster #385 | |
Fractal Calibration for Long-tailed Object Detection
Poster Session 3
Konstantinos Alexandridis · Ismail Elezi · Jiankang Deng · Anh Nguyen · Shan Luo
|
ExHall D Poster #430 | |
RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects
Poster Session 2
Soumyaratna Debnath · Ashish Tiwari · Kaustubh Sadekar · Shanmuganathan Raman
|
ExHall D Poster #38 | |
Visual Representation Learning through Causal Intervention for Controllable Image Editing
Shanshan Huang · Haoxuan Li · Chunyuan Zheng · Lei Wang · Guorui Liao · Zhili Gong · Huayi Yang · Li Liu
|
ExHall D Poster #232 | |
Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction
Poster Session 6
Shanshan Huang · Haoxuan Li · Chunyuan Zheng · Mingyuan Ge · WeiGao · Lei Wang · Li Liu
|
ExHall D Poster #244 | |
HORP: Human-Object Relation Priors Guided HOI Detection
Poster Session 5
Pei Geng · Jian Yang · Shanshan Zhang
|
ExHall D Poster #409 | |
FSHNet: Fully Sparse Hybrid Network for 3D Object Detection
Poster Session 2
Shuai Liu · Mingyue Cui · Boyang Li · Quanmin Liang · Tinghe Hong · Kai Huang · yunxiao shan · Kai Huang
|
ExHall D Poster #338 | |
Guiding Human-Object Interactions with Rich Geometry and Relations
Poster Session 5
Mengqing Xue · Yifei Liu · Ling Guo · Shaoli Huang · Changxing Ding
|
ExHall D Poster #157 | |
MATCHA: Towards Matching Anything
Fei Xue · Sven Elflein · Laura Leal-Taixe · Qunjie Zhou
|
ExHall D Poster #89 | |
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
Poster Session 1
Mahtab Bigverdi · Zelun Luo · Cheng-Yu Hsieh · Ethan Shen · Dongping Chen · Linda Shapiro · Ranjay Krishna
|
ExHall D Poster #349 | |
PICO: Reconstructing 3D People In Contact with Objects
Poster Session 1
Alpár Cseke · Shashank Tripathi · Sai Kumar Dwivedi · Arjun Lakshmipathy · Agniv Chatterjee · Michael J. Black · Dimitrios Tzionas
|
ExHall D Poster #150 | |
HUNet: Homotopy Unfolding Network for Image Compressive Sensing
Poster Session 3
Feiyang Shen · Hongping Gan
|
ExHall D Poster #205 | |
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
Poster Session 5
Sai Kumar Dwivedi · Dimitrije Antić · Shashank Tripathi · Omid Taheri · Cordelia Schmid · Michael J. Black · Dimitrios Tzionas
|
ExHall D Poster #147 | |
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
Poster Session 1
David Yifan Yao · Albert J. Zhai · Shenlong Wang
|
ExHall D Poster #88 | |
Understanding Multi-Task Activities from Single-Task Videos
Yuhan Shen · Ehsan Elhamifar
|
ExHall D Poster #317 | |
Golden Cudgel Network for Real-Time Semantic Segmentation
Poster Session 5
Guoyu Yang · Yuan Wang · Daming Shi · Yanzhong Wang
|
ExHall D Poster #413 | |
Viewpoint Rosetta Stone: Unlocking Unpaired Ego-Exo Videos for View-invariant Representation Learning
Poster Session 4
Mi Luo · Zihui Xue · Alex Dimakis · Kristen Grauman
|
ExHall D Poster #88 | |
AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering
Poster Session 2
Jing Wang · Songhe Feng · Kristoffer Knutsen Wickstrøm · Michael C. Kampffmeyer
|
ExHall D Poster #468 | |
Black Hole-Driven Identity Absorbing in Diffusion Models
Poster Session 6
Muhammad Shaheryar · Jong Taek Lee · Soon Ki Jung
|
ExHall D Poster #227 | |
Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information
Poster Session 4
Hang Shi · Chi Changxi · Peng Wan · Daoqiang Zhang · WEI SHAO
|
ExHall D Poster #476 | |
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
Poster Session 3
Shihan Wu · Ji Zhang · Pengpeng Zeng · Lianli Gao · Jingkuan Song · Heng Tao Shen
|
ExHall D Poster #390 | |
SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting
Poster Session 3
Jiahui Zhang · Fangneng Zhan · Ling Shao · Shijian Lu
|
ExHall D Poster #49 | |
Memories of Forgotten Concepts
Matan Rusanovsky · Shimon Malnick · Amir Jevnisek · Ohad Fried · Shai Avidan
|
ExHall D Poster #268 | |
Seeing More with Less: Human-like Representations in Vision Models
Andrey Gizdov · Shimon Ullman · Daniel Harari
|
ExHall D Poster #407 | |
AeSPa : Attention-guided Self-supervised Parallel Imaging for MRI Reconstruction
Poster Session 1
Jinho Joo · Hyeseong Kim · Hyeyeon Won · Deukhee Lee · Taejoon Eo · Dosik Hwang
|
ExHall D Poster #483 | |
Erasing Undesirable Influence in Diffusion Models
Poster Session 6
Jing Wu · Trung Le · Munawar Hayat · Mehrtash Harandi
|
ExHall D Poster #200 | |
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression
Poster Session 4
Dohyun Kim · Sehwan Park · GeonHee Han · Seung Wook Kim · Paul Hongsuck Seo
|
ExHall D Poster #270 | |
Efficient ANN-Guided Distillation: Aligning Rate-based Features of Spiking Neural Networks through Hybrid Block-wise Replacement
Poster Session 2
Shu Yang · Chengting Yu · Lei Liu · Hanzhi Ma · Aili Wang · Erping Li
|
ExHall D Poster #443 | |
A Unified, Resilient, and Explainable Adversarial Patch Detector
Poster Session 6
Vishesh Kumar · Akshay Agarwal
|
ExHall D Poster #406 | |
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Poster Session 4
Juan Rodriguez · Abhay Puri · Shubham Agarwal · Issam Laradji · Pau Rodriguez · Sai Rajeswar · David Vazquez · Christopher Pal · Marco Pedersoli
|
ExHall D Poster #31 | |
Toward Robust Neural Reconstruction from Sparse Point Sets
Poster Session 2
Amine Ouasfi · Shubhendu Jena · Eric Marchand · Adnane Boukhayma
|
ExHall D Poster #112 | |
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation
Poster Session 1
Aleksei Bokhovkin · Quan Meng · Shubham Tulsiani · Angela Dai
|
ExHall D Poster #43 | |
UniPhy: Learning a Unified Constitutive Model for Inverse Physics Simulation
Poster Session 4
Himangi Mittal · Peiye Zhuang · Hsin-Ying Lee · Shubham Tulsiani
|
ExHall D Poster #34 | |
DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion
Poster Session 2
Qitao Zhao · Amy Lin · Jeff Tan · Jason Y. Zhang · Deva Ramanan · Shubham Tulsiani
|
ExHall D Poster #87 | |
GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation
Poster Session 3
Haifeng Wu · Shuhang Gu · Lixin Duan · Wen Li
|
ExHall D Poster #84 | |
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
Poster Session 4
Leheng Zhang · Weiyi You · Kexuan Shi · Shuhang Gu
|
ExHall D Poster #207 | |
Unseen Visual Anomaly Generation
Poster Session 5
HAN SUN · Yunkang Cao · Hao Dong · Olga Fink
|
ExHall D Poster #427 | |
Enhancing Adversarial Transferability with Checkpoints of a Single Model’s Training
Poster Session 4
Shixin Li · Chaoxiang He · Xiaojing Ma · Bin Benjamin Zhu · Shuo Wang · Hongsheng Hu · Dongmei Zhang · Linchen Yu
|
ExHall D Poster #464 | |
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks
Poster Session 6
Nina Shvetsova · Arsha Nagrani · Bernt Schiele · Hilde Kuehne · Christian Rupprecht
|
ExHall D Poster #278 | |
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations
Poster Session 6
Jeonghyeon Kim · Sangheum Hwang
|
ExHall D Poster #366 | |
APT: Adaptive Personalized Training for Diffusion Models with Limited Data
Poster Session 6
JungWoo Chae · Jiyoon Kim · Jaewoong Choi · Kyungyul Kim · Sangheum Hwang
|
ExHall D Poster #234 | |
Differentiable Inverse Rendering with Interpretable Basis BRDFs
Poster Session 1
Hoon-Gyu Chung · Seokjun Choi · Seung-Hwan Baek
|
ExHall D Poster #29 | |
Gyro-based Neural Single Image Deblurring
Poster Session 5
Heemin Yang · Jaesung Rim · Seungyong Lee · Seung-Hwan Baek · Sunghyun Cho
|
ExHall D Poster #195 | |
Flexible Frame Selection for Efficient Video Reasoning
Poster Session 6
Shyamal Buch · Arsha Nagrani · Anurag Arnab · Cordelia Schmid
|
ExHall D Poster #280 | |
FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis
Poster Session 1
Wonjoon Jin · Qi Dai · Chong Luo · Seung-Hwan Baek · Sunghyun Cho
|
ExHall D Poster #176 | |
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories
Poster Session 5
Eric Hedlin · Munawar Hayat · Fatih Porikli · Kwang Moo Yi · Shweta Mahajan
|
ExHall D Poster #103 | |
HSI: A Holistic Style Injector for Arbitrary Style Transfer
Poster Session 5
Shuhao Zhang · Hui Kang · Yang Liu · Fang Mei · Hongjuan Li
|
ExHall D Poster #227 | |
Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction
Poster Session 5
Huiwon Jang · Sihyun Yu · Jinwoo Shin · Pieter Abbeel · Younggyo Seo
|
ExHall D Poster #171 | |
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Poster Session 3
Andrea Maracani · Savas Ozkan · Sijun Cho · Hyo-Won Kim · Eunchung Noh · Jeongwon Min · Cho Jung Min · Dookun Park · Mete Ozay
|
ExHall D Poster #369 | |
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
Poster Session 1
Guangda Ji · Silvan Weder · Francis Engelmann · Marc Pollefeys · Hermann Blum
|
ExHall D Poster #406 | |
Reconstructing Animals and the Wild
Poster Session 4
Peter Kulits · Michael J. Black · Silvia Zuffi
|
ExHall D Poster #70 | |
Task Singular Vectors: Reducing Task Interference in Model Merging
Poster Session 4
Antonio Andrea Gargiulo · Donato Crisostomi · Maria Sofia Bucarelli · Simone Scardapane · Fabrizio Silvestri · Emanuele Rodolà
|
ExHall D Poster #278 | |
Masking meets Supervision: A Strong Learning Alliance
Poster Session 4
Byeongho Heo · Taekyung Kim · Sangdoo Yun · Dongyoon Han
|
ExHall D Poster #442 | |
Preconditioners for the Stochastic Training of Neural Fields
Poster Session 6
Shin-Fang Chng · Hemanth Saratchandran · Simon Lucey
|
ExHall D Poster #102 | |
Navigating Image Restoration with VAR’s Distribution Alignment Prior
Poster Session 2
Siyang Wang · Naishan Zheng · Jie Huang · Feng Zhao
|
ExHall D Poster #209 | |
A General Adaptive Dual-level Weighting Mechanism for Remote Sensing Pansharpening
Poster Session 2
Jie Huang · Haorui Chen · Jiaxuan Ren · Siran Peng · Liang-Jian Deng
|
ExHall D Poster #199 | |
CryptoFace: End-to-End Encrypted Face Recognition
Poster Session 4
Wei Ao · Vishnu Naresh Boddeti
|
ExHall D Poster #324 | |
OralXrays-9: Towards Hospital-Scale Panoramic X-ray Anomaly Detection via Personalized Multi-Object Query-Aware Mining
Poster Session 3
Bingzhi Chen · Sisi Fu · Xiaocheng Fang · Jieyi Cai · Boya Zhang · Minhua Lu · Yishu Liu
|
ExHall D Poster #471 | |
CheXwhatsApp: A Dataset for Exploring Challenges in the Diagnosis of Chest X-rays through Mobile Devices
Poster Session 5
Mariamma Antony · Rajiv Porana · Sahil M. Lathiya · Siva Teja Kakileti · Chiranjib Bhattacharyya
|
ExHall D Poster #466 | |
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?
Poster Session 5
Yuechen Xie · Jie Song · Huiqiong Wang · Mingli Song
|
ExHall D Poster #268 | |
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Poster Session 2
SeungJu Cha · Kwanyoung Lee · Ye-Chan Kim · Hyunwoo Oh · Dong-Jin Kim
|
ExHall D Poster #255 | |
DH-Set: Improving Vision-Language Alignment with Diverse and Hybrid Set-Embeddings Learning
Poster Session 5
Kun Zhang · Jingyu Li · Zhe Li · S Kevin Zhou
|
ExHall D Poster #378 | |
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation
Poster Session 2
Sankalp Sinha · Mohammad Sadil Khan · Muhammad Usama · Shino Sam · Didier Stricker · Sk Aziz Ali · Muhammad Zeshan Afzal
|
ExHall D Poster #261 | |
Controllable Human Image Generation with Personalized Multi-Garments
Poster Session 6
Yisol Choi · Sangkyung Kwak · Sihyun Yu · Hyungwon Choi · Jinwoo Shin
|
ExHall D Poster #245 | |
Wonderland: Navigating 3D Scenes from a Single Image
Poster Session 1
Hanwen Liang · Junli Cao · Vidit Goel · Guocheng Qian · Sergei Korolev · Demetri Terzopoulos · Konstantinos N. Plataniotis · Sergey Tulyakov · Jian Ren
|
ExHall D Poster #59 | |
Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training
Poster Session 2
Lexington Whalen · Zhenbang Du · Haoran You · Chaojian Li · Sixu Li · Yingyan (Celine) Lin
|
ExHall D Poster #221 | |
What’s in the Image? A Deep-Dive into the Vision of Vision Language Models
Poster Session 3
Omri Kaduri · Shai Bagon · Tali Dekel
|
ExHall D Poster #372 | |
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Poster Session 1
Jinho Jeong · Sangmin Han · Jinwoo Kim · Seon Joo Kim
|
ExHall D Poster #206 | |
ORIDa: Object-centric Real-world Image Composition Dataset
Poster Session 1
Jinwoo Kim · Sangmin Han · Jinho Jeong · Jiwoo Choi · Dongyoung Kim · Seon Joo Kim
|
ExHall D Poster #276 | |
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Poster Session 4
Enric Corona · Andrei Zanfir · Eduard Gabriel Bazavan · NIKOS KOLOTOUROS · Thiemo Alldieck · Cristian Sminchisescu
|
ExHall D Poster #4 | |
Just Dance with pi! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection
Snehashis Majhi · Giacomo D'Amicantonio · Antitza Dantcheva · Quan Kong · Lorenzo Garattoni · Gianpiero Francesca · Egor Bondarev · Francois Bremond
|
ExHall D Poster #310 | |
Pathways on the Image Manifold: Image Editing via Video Generation
Poster Session 2
Noam Rotstein · Gal Yona · Daniel Silver · Roy Velich · David Bensaid · Ron Kimmel
|
ExHall D Poster #238 | |
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Poster Session 4
Navve Wasserman · Noam Rotstein · Roy Ganz · Ron Kimmel
|
ExHall D Poster #240 | |
AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models
Poster Session 5
Sohan Patnaik · Rishabh Jain · Balaji Krishnamurthy · Mausoom Sarkar
|
ExHall D Poster #255 | |
MVDoppler-Pose: Multi-Modal Multi-View mmWave Sensing for Long-Distance Self-Occluded Human Walking Pose Estimation
Poster Session 6
Jae-Ho Choi · Soheil Hor · Shubo Yang · Amin Arbabian
|
ExHall D Poster #151 | |
Towards Generalizable Scene Change Detection
Poster Session 5
Jae-Woo KIM · Ue-Hwan Kim
|
ExHall D Poster #328 | |
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation
Poster Session 2
Reza Abbasi · Ali Nazari · Aminreza Sefid · Mohammadali Banayeeanzade · Mohammad Rohban · Mahdieh Baghshah
|
ExHall D Poster #375 | |
Few-shot Personalized Scanpath Prediction
Poster Session 3
Ruoyu Xue · Jingyi Xu · Sounak Mondal · Hieu Le · Gregory Zelinsky · Minh Hoai · Dimitris Samaras
|
ExHall D Poster #272 | |
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
Poster Session 2
Yang Chen · Jingcai Guo · Song Guo · Dacheng Tao
|
ExHall D Poster #320 | |
Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.
Poster Session 1
Dokyoon Yoon · Youngsook Song · Woomyoung Park
|
ExHall D Poster #385 | |
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
Hanxun Yu · Wentong Li · Song Wang · Junbo Chen · Jianke Zhu
|
ExHall D Poster #335 | |
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Poster Session 3
Xinshuai Song · weixing chen · Yang Liu · Weikai Chen · Guanbin Li · Liang Lin
|
ExHall D Poster #138 | |
DepthSplat: Connecting Gaussian Splatting and Depth
Poster Session 4
Haofei Xu · Songyou Peng · Fangjinhua Wang · Hermann Blum · Daniel Barath · Andreas Geiger · Marc Pollefeys
|
ExHall D Poster #58 | |
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
Poster Session 4
Sizai Hou · Songze Li · Duanyi Yao
|
ExHall D Poster #463 | |
PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models
Poster Session 3
Dhouib Mohamed · Davide Buscaldi · Vanier Sonia · Aymen Shabou
|
ExHall D Poster #377 | |
MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations
Kyungho Bae · Jinhyung Kim · Sihaeng Lee · Soonyoung Lee · Gunhee Lee · Jinwoo Choi
|
ExHall D Poster #296 | |
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
Poster Session 6
Chris Dongjoo Kim · Jihwan Moon · Sangwoo Moon · Heeseung Yun · Sihaeng Lee · Aniruddha Kembhavi · Soonyoung Lee · Gunhee Kim · Sangho Lee · Christopher Clark
|
ExHall D Poster #277 | |
T-FAKE: Synthesizing Thermal Images for Facial Landmarking
Poster Session 6
Philipp Flotho · Moritz Piening · Anna Kukleva · Gabriele Steidl
|
ExHall D Poster #16 | |
Let Humanoids Hike! Integrative Skill Development on Complex Trails
Poster Session 5
Kwan-Yee Lin · Stella X. Yu
|
ExHall D Poster #137 | |
Opportunistic Single-Photon Time of Flight
Poster Session 4
Sotiris Nousias · Mian Wei · Howard Xiao · Maxx Wu · Shahmeer Athar · Kevin J Wang · Anagh Malik · David A. Barmherzig · David B. Lindell · Kiriakos Kutulakos
|
ExHall D Poster #68 | |
Foundations of the Theory of Performance-Based Ranking
Poster Session 3
Sébastien Piérard · Anaïs Halin · Anthony Cioppa · Adrien Deliege · Marc Van Droogenbroeck
|
ExHall D Poster #348 | |
3D-GSW: 3D Gaussian Splatting for Robust Watermarking
Poster Session 2
Youngdong Jang · Hyunje Park · Feng Yang · Heeju Ko · Euijin Choo · Sangpil Kim
|
ExHall D Poster #47 | |
Insightful Instance Features for 3D Instance Segmentation
Poster Session 3
Wonseok Roh · Hwanhee Jung · Giljoo Nam · Dong In Lee · Hyeongcheol Park · Sang Ho Yoon · Jungseock Joo · Sangpil Kim
|
ExHall D Poster #326 | |
TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing
Poster Session 6
Stefan Lionar · Jiabin Liang · Gim Hee Lee
|
ExHall D Poster #43 | |
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification
Poster Session 2
S P Sharan · Minkyu Choi · Sahil Shah · Harsh Goel · Mohammad Omama · Sandeep P. Chinchali
|
ExHall D Poster #289 | |
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Poster Session 1
Efstathios Karypidis · Ioannis Kakogeorgiou · Spyros Gidaris · Nikos Komodakis
|
ExHall D Poster #345 | |
GIF: Generative Inspiration for Face Recognition at Scale
Poster Session 1
Mohammad Saadabadi Saadabadi · Sahar Rahimi Malakshan · Ali Dabouei · Srinjoy Das · Jeremy M. Dawson · Nasser M Nasrabadi
|
ExHall D Poster #320 | |
Augmenting Perceptual Super-Resolution via Image Quality Predictors
Poster Session 1
Fengjia Zhang · Samrudhdhi Rangrej · Tristan T Aumentado-Armstrong · Afsaneh Fazly · Alex Levinshtein
|
ExHall D Poster #202 | |
BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting
Poster Session 4
Jeongwan On · Kyeonghwan Gwak · Gunyoung Kang · Junuk Cha · Soohyun Hwang · Hyein Hwang · Seungryul Baek
|
ExHall D Poster #157 | |
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
Poster Session 5
Khiem Vuong · Anurag Ghosh · Deva Ramanan · Srinivasa G. Narasimhan · Shubham Tulsiani
|
ExHall D Poster #59 | |
Learning from Synchronization: Self-Supervised Uncalibrated Multi-View Person Association in Challenging Scenes
Poster Session 5
Keqi Chen · vinkle srivastav · Didier MUTTER · Nicolas Padoy
|
ExHall D Poster #324 | |
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models
Poster Session 1
Kevin Miller · Aditya Gangrade · Samarth Mishra · Kate Saenko · Venkatesh Saligrama
|
ExHall D Poster #398 | |
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator
Poster Session 2
Chaehun Shin · Jooyoung Choi · Heeseung Kim · Sungroh Yoon
|
ExHall D Poster #250 | |
DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows
Poster Session 5
Mashrur M. Morshed · Vishnu Naresh Boddeti
|
ExHall D Poster #215 | |
Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales
Poster Session 1
Shuokai Pan · Gerti Tuzi · Sudarshan Sreeram · Dibakar Gope
|
ExHall D Poster #374 | |
Certified Human Trajectory Prediction
Poster Session 3
Mohammadhossein Bahari · Saeed Saadatnejad · Amirhossein Askari Farsangi · Seyed-Mohsen Moosavi-Dezfooli · Alex Alahi
|
ExHall D Poster #158 | |
MotionMap: Representing Multimodality in Human Pose Forecasting
Poster Session 5
Reyhaneh Hosseininejad · Megh Shukla · Saeed Saadatnejad · Mathieu Salzmann · Alex Alahi
|
ExHall D Poster #154 | |
ViUniT: Visual Unit Tests for More Robust Visual Programming
Poster Session 5
Artemis Panagopoulou · Honglu Zhou · silvio savarese · Caiming Xiong · Chris Callison-Burch · Mark Yatskar · Juan Carlos Niebles
|
ExHall D Poster #346 | |
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Poster Session 2
Hasin Us Sami · Swapneel Sen · Amit K. Roy-Chowdhury · Srikanth Krishnamurthy · Basak Guler
|
ExHall D Poster #462 | |
Circumventing Shortcuts in Audio-visual Deepfake Detection Datasets with Unsupervised Learning
Stefan Smeu · Dragos-Alexandru Boldisor · Dan Oneata · Elisabeta Oneata
|
ExHall D Poster #289 | |
AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification
Poster Session 1
Huy Nguyen · Kien Nguyen Thanh · Akila Pemasiri · Feng Liu · Sridha Sridharan · Clinton Fookes
|
ExHall D Poster #100 | |
Adaptive Non-Uniform Timestep Sampling for Accelerating Diffusion Model Training
Poster Session 1
Myunsoo Kim · Donghyeon Ki · Seong-Woong Shim · Byung-Jun Lee
|
ExHall D Poster #225 | |
Reasoning in Visual Navigation of End-to-end Trained Agents: A Dynamical Systems Approach
Steeven JANNY · Hervé Poirier · Leonid Antsfeld · Guillaume Bono · Gianluca Monaci · Boris Chidlovskii · Francesco Giuliari · Alessio Del Bue · Christian Wolf
|
ExHall D Poster #141 | |
PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction
Poster Session 4
Sinisa Stekovic · Arslan Artykov · Stefan Ainetter · Mattia Durso · Friedrich Fraundorfer
|
ExHall D Poster #41 | |
CleanDIFT: Diffusion Features without Noise
Poster Session 1
Nick Stracke · Stefan Andreas Baumann · Kolja Bauer · Frank Fundel · Björn Ommer
|
ExHall D Poster #218 | |
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Poster Session 6
Hanzhi Chen · Boyang Sun · Anran Zhang · Marc Pollefeys · Stefan Leutenegger
|
ExHall D Poster #143 | |
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail
Poster Session 1
Luca Bartolomei · Fabio Tosi · Matteo Poggi · Stefano Mattoccia
|
ExHall D Poster #79 | |
GLASS: Guided Latent Slot Diffusion for Object-Centric Learning
Poster Session 6
Krishnakant Singh · Simone Schaub-Meyer · Stefan Roth
|
ExHall D Poster #239 | |
Scene-Centric Unsupervised Panoptic Segmentation
Poster Session 5
Oliver Hahn · Christoph Reich · Nikita Araslanov · Daniel Cremers · Christian Rupprecht · Stefan Roth
|
ExHall D Poster #330 | |
Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation
Poster Session 5
David T. Hoffmann · Syed Haseeb Raza · Hanqiu Jiang · Steffen Klingenhoefer · Denis Tananaev · Martin Meinke
|
ExHall D Poster #122 | |
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds
Poster Session 2
Barza Nisar · Steven L. Waslander
|
ExHall D Poster #124 | |
MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects
Poster Session 3
Kevin Zhang · Jia-Bin Huang · Jose Echevarria · Stephen DiVerdi · Aaron Hertzmann
|
ExHall D Poster #25 | |
VI^3NR: Variance Informed Initialization for Implicit Neural Representations
Poster Session 3
Chamin Hewa Koneputugodage · Yizhak Ben-Shabat · Sameera Ramasinghe · Stephen Gould
|
ExHall D Poster #270 | |
Pos3R: 6D Pose Estimation for Unseen Objects Made Easy
Poster Session 4
Weijian Deng · Dylan Campbell · Chunyi Sun · Jiahao Zhang · Shubham Kanitkar · Matthew Shaffer · Stephen Gould
|
ExHall D Poster #95 | |
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection
Poster Session 1
Marc-Antoine Lavoie · Anas Mahmoud · Steven L. Waslander
|
ExHall D Poster #433 | |
Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving
Poster Session 3
Alexey Nekrasov · Malcolm Burdorf · Stewart Worrall · Bastian Leibe · Julie Stephany Berrio Perez
|
ExHall D Poster #119 | |
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
Poster Session 4
Yu Qi · Yuanchen Ju · Tianming Wei · Chi Chu · Lawson L.S. Wong · Huazhe Xu
|
ExHall D Poster #152 | |
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Poster Session 5
Gyeongjin Kang · Jisang Yoo · Jihyeon Park · Seungtae Nam · Hyeonsoo Im · Shin sangheon · Sangpil Kim · Eunbyung Park
|
ExHall D Poster #92 | |
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents
Poster Session 5
Jun Chen · Dannong Xu · Junjie Fei · Chun-Mei Feng · Mohamed Elhoseiny
|
ExHall D Poster #362 | |
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings
Poster Session 5
Aayush Dhakal · Srikumar Sastry · Subash Khanal · Adeel Ahmad · Eric Xing · Nathan Jacobs
|
ExHall D Poster #349 | |
Believing is Seeing: Unobserved Object Detection using Generative Models
Poster Session 4
Subhransu S. Bhattacharjee · Dylan Campbell · Rahul Shome
|
ExHall D Poster #340 | |
Towards In-the-wild 3D Plane Reconstruction from a Single Image
Jiachen Liu · Rui Yu · Sili Chen · Sharon X. Huang · Hengkai Guo
|
ExHall D Poster #84 | |
Minority-Focused Text-to-Image Generation via Prompt Optimization
Poster Session 5
Soobin Um · Jong Chul Ye
|
ExHall D Poster #243 | |
DrVideo: Document Retrieval Based Long Video Understanding
Poster Session 4
Ziyu Ma · Chenhui Gou · Hengcan Shi · Bin Sun · Shutao Li · Hamid Rezatofighi · Jianfei Cai
|
ExHall D Poster #300 | |
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation
Poster Session 5
Jungsoo Lee · Debasmit Das · Munawar Hayat · Sungha Choi · Kyuwoong Hwang · Fatih Porikli
|
ExHall D Poster #395 | |
Polarized Color Screen Matting
Kenji Enomoto · Scott Cohen · Brian Price · TJ Rhodes
|
ExHall D Poster #21 | |
Style-Editor: Text-driven Object-centric Style Editing
Jihun Park · Jongmin Gim · Kyoungmin Lee · Seunghun Lee · Sunghoon Im
|
ExHall D Poster #237 | |
Cross-View Completion Models are Zero-shot Correspondence Estimators
Honggyu An · Jin Hyeon Kim · Seonghoon Park · Sunghwan Hong · Jaewoo Jung · Jisang Han · Seungryong Kim
|
ExHall D Poster #87 | |
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Poster Session 4
Gyeongrok Oh · Sungjune Kim · Heeju Ko · Hyunggun Chi · Jinkyu Kim · Dongwook Lee · Daehyun Ji · Sungjoon Choi · Sujin Jang · Sangpil Kim
|
ExHall D Poster #126 | |
Dense-SfM: Structure from Motion with Dense Consistent Matching
Poster Session 2
JongMin Lee · Sungjoo Yoo
|
ExHall D Poster #98 | |
Co-op: Correspondence-based Novel Object Pose Estimation
Poster Session 3
Sungphill Moon · Hyeontae Son · Dongcheol Hur · Sangwook Kim
|
ExHall D Poster #94 | |
LAL: Enhancing 3D Human Motion Prediction with Latency-aware Auxiliary Learning
Poster Session 2
Xiaoning Sun · Dong Wei · Huaijiang Sun · Shengxiang Hu
|
ExHall D Poster #167 | |
ALIEN: Implicit Neural Representations for Human Motion Prediction under Arbitrary Latency
Dong Wei · Xiaoning Sun · Xizhan Gao · Shengxiang Hu · Huaijiang Sun
|
ExHall D Poster #157 | |
Shape Abstraction via Marching Differentiable Support Functions
Sunkyung Park · Jeongmin Lee · Dongjun Lee
|
ExHall D Poster #104 | |
Identity-preserving Distillation Sampling by Fixed-Point Iterator
Poster Session 3
SeonHwa Kim · Jiwon Kim · Soobin Park · Donghoon Ahn · Jiwon Kang · Seungryong Kim · Kyong Hwan Jin · Eunju Cha
|
ExHall D Poster #44 | |
POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality
Poster Session 1
Joey Wilson · Marcelino M. de Almeida · Sachit Mahajan · Martin Labrie · Maani Ghaffari · Omid Ghasemalizadeh · Min Sun · Cheng-Hao Kuo · Arnab Sen
|
ExHall D Poster #331 | |
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport
Poster Session 3
Mengnan Liu · Le Wang · Sanping Zhou · Kun Xia · Xiaolong Sun · Gang Hua
|
ExHall D Poster #307 | |
ROLL: Robust Noisy Pseudo-label Learning for Multi-View Clustering with Noisy Correspondence
Yuan Sun · Yongxiang Li · Zhenwen Ren · Guiduo Duan · Dezhong Peng · Peng Hu
|
ExHall D Poster #439 | |
Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval
Poster Session 4
Siyuan Duan · Yuan Sun · Dezhong Peng · Zheng Liu · Xiaomin Song · Peng Hu
|
ExHall D Poster #470 | |
Seurat: From Moving Points to Depth
Seokju Cho · Gabriel Huang · Seungryong Kim · Joon-Young Lee
|
ExHall D Poster #177 | |
Exploring Temporally-Aware Features for Point Tracking
Poster Session 1
Inès Hyeonsu Kim · Seokju Cho · Gabriel Huang · Jung Yi · Joon-Young Lee · Seungryong Kim
|
ExHall D Poster #166 | |
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling
Poster Session 3
Junha Hyung · Kinam Kim · Susung Hong · Min-Jung Kim · Jaegul Choo
|
ExHall D Poster #34 | |
Annotation Ambiguity Aware Semi-Supervised Medical Image Segmentation
Suruchi Kumari · Pravendra Singh
|
ExHall D Poster #479 | |
Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories
Poster Session 4
Susung Hong · Johanna Suvi Karras · Ricardo Martin · Ira Kemelmacher-Shlizerman
|
ExHall D Poster #42 | |
SapiensID: Foundation for Human Recognition
Poster Session 3
Minchul Kim · Dingqiang Ye · Yiyang Su · Feng Liu · Xiaoming Liu
|
ExHall D Poster #314 | |
Encapsulated Composition of Text-to-Image and Text-to-Video Models for High-Quality Video Synthesis
Poster Session 4
Tongtong Su · Chengyu Wang · Bingyan Liu · Jun Huang · Dongming Lu
|
ExHall D Poster #229 | |
Light3R-SfM: Towards Feed-forward Structure-from-Motion
Sven Elflein · Qunjie Zhou · Laura Leal-Taixe
|
ExHall D Poster #91 | |
Category-Agnostic Neural Object Rigging
Poster Session 5
Guangzhao He · Chen Geng · Shangzhe Wu · Jiajun Wu
|
ExHall D Poster #98 | |
Test-time Augmentation Improves Efficiency in Conformal Prediction
Poster Session 4
Divya M Shanmugam · Helen Lu · Swami Sankaranarayanan · John Guttag
|
ExHall D Poster #458 | |
Do ImageNet-trained Models Learn Shortcuts? The Impact of Frequency Shortcuts on Generalization
Poster Session 5
Shunxin Wang · Raymond Veldhuis · Nicola Strisciuglio
|
ExHall D Poster #397 | |
Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration
Poster Session 3
Aocheng Li · James R. Zimmer-Dauphinee · Rajesh Kalyanam · Ian Lindsay · Parker VanValkenburgh · Steven Wernke · Daniel Aliaga
|
ExHall D Poster #107 | |
T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning
Poster Session 3
Seong-Hyeon Hwang · Minsu Kim · Steven Euijong Whang
|
ExHall D Poster #449 | |
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
Poster Session 2
JunYong Choi · Min-Cheol Sagong · SeokYeong Lee · Seung-Won Jung · Ig-Jae Kim · Junghyun Cho
|
ExHall D Poster #31 | |
Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers
Jung-Ho Hong · Ho-Joong Kim · Kyu-Sung Jeon · Seong-Whan Lee
|
ExHall D Poster #394 | |
MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation
Poster Session 6
Huaize Liu · WenZhang Sun · Donglin Di · Shibo Sun · Jiahui Yang · Hujun Bao · Changqing Zou
|
ExHall D Poster #2 | |
Dynamic Pseudo Labeling via Gradient Cutting for High-Low Entropy Exploration
Poster Session 4
Jae Hyeon Park · Joo Hyeon Jeon · Jae Yun Lee · Sangyeon Ahn · MinHee Cha · Min Geol Kim · Hyeok Nam · Sung In Cho
|
ExHall D Poster #456 | |
Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation
Poster Session 4
Byung Hyun Lee · Sungjin Lim · Se Young Chun
|
ExHall D Poster #269 | |
NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting
Yulong Zheng · Zicheng Jiang · Shengfeng He · Yandu Sun · Junyu Dong · Huaidong Zhang · Yong Du
|
ExHall D Poster #63 | |
DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation
Poster Session 6
Maregu Assefa · Muzammal Naseer · IYYAKUTTI IYAPPAN GANAPATHI · Syed Sadaf Ali · Mohamed L Seghier · Naoufel Werghi
|
ExHall D Poster #450 | |
Exploiting Temporal State Space Sharing for Video Semantic Segmentation
Poster Session 5
Hesham Syed · Yun Liu · Guolei Sun · Henghui Ding · Jing Yang · Ender Konukoglu · Xue Geng · Xudong Jiang
|
ExHall D Poster #305 | |
HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics
Poster Session 4
Jongsung Lee · HARIN PARK · Byeong-Uk Lee · Kyungdon Joo
|
ExHall D Poster #73 | |
UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior
I-Hsiang (Aaron) Chen · Wei-Ting Chen · Yu-Wei Liu · Yuan-Chun Chiang · Sy-Yen Kuo · Ming-Hsuan Yang
|
ExHall D Poster #206 | |
Effective SAM Combination for Open-Vocabulary Semantic Segmentation
Poster Session 6
Minhyeok Lee · Suhwan Cho · Jungho Lee · Sunghun Yang · Heeseung Choi · Ig-Jae Kim · Sangyoun Lee
|
ExHall D Poster #383 | |
Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free Unsupervised Domain Adaptation
Poster Session 6
Peihua Deng · Jiehua Zhang · Xichun Sheng · Chenggang Yan · Yaoqi Sun · Ying Fu · Liang Li
|
ExHall D Poster #423 | |
VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Poster Session 1
Silin Gao · Sheryl Mathew · Li Mi · Sepideh Mamooler · Mengjie Zhao · Hiromi Wakaki · Yuki Mitsufuji · Syrielle Montariol · Antoine Bosselut
|
ExHall D Poster #259 | |
Textured Gaussians for Enhanced 3D Scene Appearance Modeling
Poster Session 2
Brian Chao · Hung-Yu Tseng · Lorenzo Porzi · Chen Gao · Tuotuo Li · Qinbo Li · Ayush Saraf · Jia-Bin Huang · Johannes Kopf · Gordon Wetzstein · Changil Kim
|
ExHall D Poster #344 | |
Decentralized Diffusion Models
Poster Session 5
David McAllister · Matthew Tancik · Jiaming Song · Angjoo Kanazawa
|
ExHall D Poster #217 | |
Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance
Poster Session 3
Dimitrios Gerogiannis · Foivos Paraperas Papantoniou · Rolandos Alexandros Potamias · Alexandros Lattas · Stefanos Zafeiriou
|
ExHall D Poster #11 | |
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight
Poster Session 1
Cédric Vincent · Taehyoung Kim · Henri Meeß
|
ExHall D Poster #121 | |
Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM
Poster Session 2
Qiyuan Dai · Sibei Yang
|
ExHall D Poster #397 | |
EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching
Poster Session 2
Dongki Jung · Jaehoon Choi · Yonghan Lee · Somi Jeong · Taejae Lee · Dinesh Manocha · Suyong Yeon
|
ExHall D Poster #92 | |
Sufficient Invariant Learning for Distribution Shift
Poster Session 1
Taero Kim · Subeen Park · Sungjun Lim · Yonghan Jung · Krikamol Muandet · Kyungwoo Song
|
ExHall D Poster #458 | |
CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation
Poster Session 5
Yuxing Long · Jiyao Zhang · Mingjie Pan · Tianshu Wu · Taewhan Kim · Hao Dong
|
ExHall D Poster #146 | |
Explaining in Diffusion: Explaining a Classifier with Diffusion Semantics
Poster Session 3
Tahira Kazimi · Ritika Allada · Pinar Yanardag
|
ExHall D Poster #397 | |
LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions
Poster Session 1
Faridoun Mehri · Mahdieh Baghshah · Mohammad Taher Pilehvar
|
ExHall D Poster #397 | |
VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
Poster Session 5
Ryota Tanaka · Taichi Iki · Taku Hasegawa · Kyosuke Nishida · Kuniko Saito · Jun Suzuki
|
ExHall D Poster #363 | |
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
Poster Session 3
Ning Gao · Yilun Chen · Shuai Yang · Xinyi Chen · Yang Tian · Hao Li · Haifeng Huang · Hanqing Wang · Tai Wang · Jiangmiao Pang
|
ExHall D Poster #148 | |
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Poster Session 6
Ho Kei Cheng · Masato Ishii · Akio Hayakawa · Takashi Shibuya · Alexander G. Schwing · Yuki Mitsufuji
|
ExHall D Poster #260 | |
PS-EIP: Robust Photometric Stereo Based on Event Interval Profile
Poster Session 2
Kazuma Kitazawa · Takahito Aoto · Satoshi Ikehata · Tsuyoshi Takatani
|
ExHall D Poster #77 | |
D^3-Human: Dynamic Disentangled Digital Human from Monocular Video
Poster Session 3
Honghu Chen · Bo Peng · Yunfan Tao · Juyong Zhang
|
ExHall D Poster #17 | |
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Poster Session 5
Jianing Yang · Alexander Sax · Kevin Liang · Mikael Henaff · Hao Tang · Ang Cao · Joyce Chai · Franziska Meier · Matt Feiszli
|
ExHall D Poster #83 | |
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations
Poster Session 1
Shengeng Tang · Jiayi He · Lechao Cheng · Jingjing Wu · Dan Guo · Richang Hong
|
ExHall D Poster #316 | |
PDFactor: Learning Tri-Perspective View Policy Diffusion Field for Multi-Task Robotic Manipulation
Poster Session 4
Jingyi Tian · Le Wang · Sanping Zhou · Sen Wang · lijiayi · Haowen Sun · Wei Tang
|
ExHall D Poster #144 | |
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
Poster Session 3
Sen Wang · Le Wang · Sanping Zhou · Jingyi Tian · lijiayi · Haowen Sun · Wei Tang
|
ExHall D Poster #147 | |
FiRe: Fixed-points of Restoration Priors for Solving Inverse Problems
Poster Session 5
Matthieu Terris · Ulugbek Kamilov · Thomas Moreau
|
ExHall D Poster #203 | |
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation
Poster Session 4
Libiao Chen · Dong Nie · Junjun Pan · Jing Yan · Zhenyu Tang
|
ExHall D Poster #427 | |
SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model
Poster Session 1
Shuhan Tan · John Wheatley Lambert · Hong Jeon · Sakshum Kulshrestha · Yijing Bai · Jing Luo · Dragomir Anguelov · Mingxing Tan · Chiyu “Max” Jiang
|
ExHall D Poster #131 | |
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
Poster Session 6
Tanner Schmidt · Richard Newcombe
|
ExHall D Poster #313 | |
Hyperbolic Uncertainty-Aware Few-Shot Incremental Point Cloud Segmentation
Poster Session 3
TANUJ SUR · Samrat Mukherjee · Kaizer Rahaman · Subhasis Chaudhuri · Muhammad Haris Khan · Biplab Banerjee
|
ExHall D Poster #113 | |
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
Poster Session 3
Stefan Andreas Baumann · Felix Krause · Michael Neumayr · Nick Stracke · Melvin Sevi · Vincent Tao Hu · Björn Ommer
|
ExHall D Poster #246 | |
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving
Poster Session 3
Xuesong Chen · Linjiang Huang · Tao Ma · Rongyao Fang · Shaoshuai Shi · Hongsheng Li
|
ExHall D Poster #137 | |
LSNet: See Large, Focus Small
Poster Session 2
Ao Wang · Hui Chen · Zijia Lin · Jungong Han · Guiguang Ding
|
ExHall D Poster #414 | |
A Flag Decomposition for Hierarchical Datasets
Poster Session 4
Nathan Mankovich · Ignacio Santamaria · Gustau Camps-Valls · Tolga Birdal
|
ExHall D Poster #282 | |
PhysAnimator: Physics-Guided Generative Cartoon Animation
Poster Session 3
Tianyi Xie · Yiwei Zhao · Ying Jiang · Chenfanfu Jiang
|
ExHall D Poster #13 | |
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Poster Session 5
Filippo Ziliotto · Tommaso Campari · Luciano Serafini · Lamberto Ballan
|
ExHall D Poster #342 | |
Detecting Out-of-Distribution Through the Lens of Neural Collapse
Poster Session 3
Litian Liu · Yao Qin
|
ExHall D Poster #457 | |
Focal Split: Untethered Snapshot Depth from Differential Defocus
Poster Session 6
Junjie Luo · John Mamish · Alan Fu · Thomas Concannon · Josiah Hester · Emma Alexander · Qi Guo
|
ExHall D Poster #78 | |
Task-Aware Clustering for Prompting Vision-Language Models
Poster Session 3
Fusheng Hao · Fengxiang He · Fuxiang Wu · Tichao Wang · Chengqun Song · Jun Cheng
|
ExHall D Poster #392 | |
Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images
Poster Session 1
Tai Nguyen · Aref Azizpour · Matthew Stamm
|
ExHall D Poster #275 | |
Evaluating Model Perception of Color Illusions in Photorealistic Scenes
Poster Session 2
Lingjun Mao · Zineng Tang · Alane Suhr
|
ExHall D Poster #233 | |
CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Poster Session 4
Jungho Lee · Suhwan Cho · Taeoh Kim · Ho-Deok Jang · Minhyeok Lee · Geonho Cha · Dongyoon Wee · Dogyoon Lee · Sangyoun Lee
|
ExHall D Poster #24 | |
CARL: A Framework for Equivariant Image Registration
Poster Session 5
Hastings Greer · Lin Tian · François-Xavier Vialard · Roland Kwitt · Raúl San José Estépar · Marc Niethammer
|
ExHall D Poster #478 | |
Autoregressive Distillation of Diffusion Transformers
Poster Session 4
Yeongmin Kim · Sotiris Anagnostidis · Yuming Du · Edgar Schoenfeld · Jonas Kohler · Markos Georgopoulos · Albert Pumarola · Ali Thabet · Artsiom Sanakoyeu
|
ExHall D Poster #230 | |
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute
Sotiris Anagnostidis · Gregor Bachmann · Yeongmin Kim · Jonas Kohler · Markos Georgopoulos · Artsiom Sanakoyeu · Yuming Du · Albert Pumarola · Ali Thabet · Edgar Schoenfeld
|
ExHall D Poster #205 | |
Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models
Poster Session 2
Kartik Thakral · Tamar Glaser · Tal Hassner · Mayank Vatsa · Richa Singh
|
ExHall D Poster #358 | |
FSboard: Over 3 Million Characters of ASL Fingerspelling Collected via Smartphones
Poster Session 3
Manfred Georg · Garrett Tanzer · Esha Uboweja · Saad Hassan · Maximus Shengelia · Sam Sepah · Sean Forbes · Thad Starner
|
ExHall D Poster #310 | |
Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space
Poster Session 2
Leonhard Sommer · Olaf Dünkel · Christian Theobalt · Adam Kortylewski
|
ExHall D Poster #104 | |
Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields
Poster Session 3
Navami Kairanda · Marc Habermann · Shanthika Shankar Naik · Christian Theobalt · Vladislav Golyanik
|
ExHall D Poster #68 | |
LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty
Poster Session 2
Christoforos N. Spartalis · Theodoros Semertzidis · Efstratios Gavves · Petros Daras
|
ExHall D Poster #445 | |
Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy
Poster Session 6
Aditya Ganeshan · Thibault Groueix · Paul Guerrero · Radomir Mech · Matthew Fisher · Daniel Ritchie
|
ExHall D Poster #243 | |
RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges
Poster Session 6
Thibaut Loiseau · Guillaume Bourmaud
|
ExHall D Poster #88 | |
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
Lee Chae-Yeon · Hyun-Bin Oh · EunGi Han · Kim Sung-Bin · Suekyeong Nam · Tae-Hyun Oh
|
ExHall D Poster #2 | |
ProbeSDF: Light Field Probes For Neural Surface Reconstruction
Poster Session 3
Briac Toussaint · Diego Thomas · Jean-Sébastien Franco
|
ExHall D Poster #36 | |
Finsler Multi-Dimensional Scaling: Manifold Learning for Asymmetric Dimensionality Reduction and Embedding
Poster Session 5
Thomas Dagès · Simon Weber · Ya-Wei Eileen Lin · Ronen Talmon · Daniel Cremers · Michael Lindenbaum · Alfred M. Bruckstein · Ron Kimmel
|
ExHall D Poster #462 | |
Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision
Manon Dampfhoffer · Thomas Mesquida · Damien Joubert · Thomas Dalgaty · Pascal Vivet · Christoph Posch
|
ExHall D Poster #147 | |
A Theory of Learning Unified Model via Knowledge Integration from Label Space Varying Domains
Poster Session 2
Dexuan Zhang · Thomas Westfechtel · Tatsuya Harada
|
ExHall D Poster #454 | |
The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition
Poster Session 2
Otto Brookes · Maksim Kukushkin · Majid Mirmehdi · Colleen Stephens · Paula Dieguez · Thurston Cleveland Hicks · Sorrel CZ Jones · Kevin C. Lee · Maureen S. McCarthy · Amelia C. Meier · NORMAND E. · Erin G. Wessling · Roman M. Wittig · Kevin Langergraber · Klaus Zuberbühler · Lukas Boesch · Thomas Schmid · Mimi Arandjelovic · Hjalmar S. Kühl · Tilo Burghardt
|
ExHall D Poster #277 | |
CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections
Poster Session 6
Thomas Walker · Salvatore Esposito · Daniel Rebain · Amir Vaxman · Arno Onken · Changjian Li · Oisin Mac Aodha
|
ExHall D Poster #457 | |
Geometry-guided Online 3D Video Synthesis with Multi-View Temporal Consistency
Poster Session 3
Hyunho Ha · Lei Xiao · Christian Richardt · Thu Nguyen-Phuoc · Changil Kim · Min H. Kim · Douglas Lanman · Numair Khan
|
ExHall D Poster #59 | |
Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
Poster Session 5
Bingliang Zhang · Wenda Chu · Julius Berner · Chenlin Meng · Anima Anandkumar · Yang Song
|
ExHall D Poster #200 | |
POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation
Poster Session 3
Jian Wang · Tianhong Dai · Bingfeng Zhang · Siyue Yu · ENG GEE LIM · Jimin Xiao
|
ExHall D Poster #422 | |
Perceptual Inductive Bias Is What You Need Before Contrastive Learning
Poster Session 2
Junru Zhao · Tianqin Li · Dunhan Jiang · Shenghao Wu · Alan Ramirez · Tai Sing Lee
|
ExHall D Poster #405 | |
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
Poster Session 1
Shuming Liu · Chen Zhao · Tianqi Xu · Bernard Ghanem
|
ExHall D Poster #301 | |
Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection
Poster Session 2
Ting Li · Mao Ye · Tianwen Wu · Nianxin Li · Shuaifeng Li · Song Tang · Luping Ji
|
ExHall D Poster #128 | |
Degradation-Aware Feature Perturbation for All-in-One Image Restoration
Poster Session 6
Xiangpeng Tian · Xiangyu Liao · Xiao Liu · Meng Li · Chao Ren
|
ExHall D Poster #191 | |
KMD: Koopman Multi-modality Decomposition for Generalized Brain Tumor Segmentation under Incomplete Modalities
Poster Session 3
Tianyi Liu · Haochuan Jiang · Kaizhu Huang
|
ExHall D Poster #480 | |
Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration
Poster Session 2
Yiyang Chen · Tianyu Ding · Lei Wang · Jing Huo · Yang Gao · Wenbin Li
|
ExHall D Poster #429 | |
Adaptive Keyframe Sampling for Long Video Understanding
Poster Session 6
Xi Tang · Jihao Qiu · Lingxi Xie · Yunjie Tian · Jianbin Jiao · Qixiang Ye
|
ExHall D Poster #284 | |
Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift
Poster Session 2
Siyuan Liang · Jiawei Liang · Tianyu Pang · Chao Du · Aishan Liu · Mingli Zhu · Xiaochun Cao · Dacheng Tao
|
ExHall D Poster #391 | |
MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views
Antoine Guédon · Tomoki Ichikawa · Kohei Yamashita · Ko Nishino
|
ExHall D Poster #53 | |
Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis
Poster Session 1
Tim Büchner · Christoph Anders · Orlando Guntinas-Lichius · Joachim Denzler
|
ExHall D Poster #5 | |
CLOC: Contrastive Learning for Ordinal Classification with Multi-Margin N-pair Loss
Poster Session 3
Dileepa Pitawela · Gustavo Carneiro · Hsiang-Ting Chen
|
ExHall D Poster #468 | |
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Seil Kang · Jinyeong Kim · Junhyeok Kim · Seong Jae Hwang
|
ExHall D Poster #378 | |
Unsupervised Foundation Model-Agnostic Slide-Level Representation Learning
Poster Session 6
Tim Lenz · Peter Neidlinger · Marta Ligero · Georg Wölflein · Marko van Treeck · Jakob Nikolas Kather
|
ExHall D Poster #446 | |
Theory-Inspired Deep Multi-View Multi-Label Learning with Incomplete Views and Noisy Labels
Poster Session 4
Quanjiang Li · Tingjin Luo · Jiahui Liao
|
ExHall D Poster #466 | |
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Poster Session 5
Zhangqi Jiang · Junkai Chen · Beier Zhu · Tingjin Luo · Yankun Shen · Xu Yang
|
ExHall D Poster #379 | |
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition
Poster Session 2
Jiawei Lin · Shizhao Sun · Danqing Huang · Ting Liu · Ji Li · Jiang Bian
|
ExHall D Poster #263 | |
Hyperspectral Pansharpening via Diffusion Models with Iteratively Zero-Shot Guidance
Poster Session 3
Jin-Liang Xiao · Ting-Zhu Huang · Liang-Jian Deng · Guang Lin · Zihan Cao · Chao Li · Qibin Zhao
|
ExHall D Poster #193 | |
Charm: The Missing Piece in ViT Fine-Tuning for Image Aesthetic Assessment
Poster Session 2
Fatemeh Behrad · Tinne Tuytelaars · Johan Wagemans
|
ExHall D Poster #234 | |
TinyFusion: Diffusion Transformers Learned Shallow
Gongfan Fang · Kunjun Li · Xinyin Ma · Xinchao Wang
|
ExHall D Poster #223 | |
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Poster Session 3
Saksham Kushwaha Kushwaha · Yapeng Tian
|
ExHall D Poster #275 | |
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Poster Session 3
Dong In Lee · Hyeongcheol Park · Jiyoung Seo · Eunbyung Park · Hyunje Park · Ha Dam Baek · Shin sangheon · sangmin kim · Sangpil Kim
|
ExHall D Poster #46 | |
Your ViT is Secretly an Image Segmentation Model
Poster Session 5
Tommie Kerssies · Niccolò Cavagnero · Alexander Hermans · Narges Norouzi · Giuseppe Averta · Bastian Leibe · Gijs Dubbelman · Daan de Geus
|
ExHall D Poster #407 | |
Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception
Poster Session 5
Luke Chen · Junyao Wang · Trier Mortlock · Pramod Khargonekar · Mohammad Al Faruque
|
ExHall D Poster #120 | |
NSD-Imagery: A Benchmark Dataset for Extending fMRI Vision Decoding Methods to Mental Imagery
Reese Kneeland · Paul Scotti · Ghislain St-Yves · Jesse L Breedlove · Kendrick N Kay · Thomas Naselaris
|
ExHall D Poster #256 | |
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
Poster Session 6
Ozgur Kara · Krishna Kumar Singh · Feng Liu · Duygu Ceylan · James Rehg · Tobias Hinz
|
ExHall D Poster #213 | |
Hyperbolic Safety-Aware Vision-Language Models
Tobia Poppi · Tejaswi Kasarla · Pascal Mettes · Lorenzo Baraldi · Rita Cucchiara
|
ExHall D Poster #387 | |
GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion
Poster Session 2
Jiapeng Tang · Davide Davoli · Tobias Kirschstein · Liam Schoneveld · Matthias Nießner
|
ExHall D Poster #10 | |
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding
Mingfei Chen · Israel D. Gebru · Ishwarya Ananthabhotla · Christian Richardt · Dejan Markovic · Steven Krenn · Todd Keebler · Jacob Sandakly · Alexander Richard · Eli Shlizerman
|
ExHall D Poster #283 | |
Interpretable Generative Models through Post-hoc Concept Bottlenecks
Poster Session 2
Akshay R. Kulkarni · Ge Yan · Chung-En Sun · Tuomas Oikarinen · Tsui-Wei Weng
|
ExHall D Poster #266 | |
RelationField: Relate Anything in Radiance Fields
Poster Session 5
Sebastian Koch · Johanna Wald · Mirco Colosi · Narunas Vaskevicius · Pedro Hermosilla · Federico Tombari · Timo Ropinski
|
ExHall D Poster #62 | |
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation
Poster Session 2
Reza Qorbani · Gianluca Villani · Theodoros Panagiotakopoulos · Marc Botet Colomer · Linus Härenstam-Nielsen · Mattia Segu · Pier Luigi Dovesi · Jussi Karlgren · Daniel Cremers · Federico Tombari · Matteo Poggi
|
ExHall D Poster #422 | |
Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)
Poster Session 1
Tomer Garber · Tom Tirer
|
ExHall D Poster #210 | |
Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models
Poster Session 4
Reza Shirkavand · Peiran Yu · Shangqian Gao · Gowthami Somepalli · Tom Goldstein · Heng Huang
|
ExHall D Poster #271 | |
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Poster Session 2
Jiange Yang · Haoyi Zhu · Yating Wang · Gangshan Wu · Tong He · Limin Wang
|
ExHall D Poster #152 | |
Towards Scalable Human-aligned Benchmark for Text-guided Image Editing
Poster Session 4
Suho Ryu · Kihyun Kim · Eugene Baek · Dongsoo Shin · Joonseok Lee
|
ExHall D Poster #238 | |
Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization
Poster Session 1
Maxime Pietrantoni · Gabriela Csurka · Torsten Sattler
|
ExHall D Poster #85 | |
EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering
Poster Session 6
Toshiya Yura · Ashkan Mirzaei · Igor Gilitschenski
|
ExHall D Poster #70 | |
Unsupervised Discovery of Facial Landmarks and Head Pose
Poster Session 5
Satyajit Tourani · Siddharth Tourani · Arif Mahmood · Muhammad Haris Khan
|
ExHall D Poster #14 | |
Video Depth without Video Models
Poster Session 2
Bingxin Ke · Dominik Narnhofer · Shengyu Huang · Lei Ke · Torben Peters · Katerina Fragkiadaki · Anton Obukhov · Konrad Schindler
|
ExHall D Poster #179 | |
CoLLM: A Large Language Model for Composed Image Retrieval
Poster Session 1
Chuong Huynh · Jinyu Yang · Ashish Tawari · Mubarak Shah · Son Dinh Tran · Raffay Hamid · Trishul Chilimbi · Abhinav Shrivastava
|
ExHall D Poster #364 | |
ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On
Poster Session 6
Ji Woo Hong · Tri Ton · Trung X. Pham · Gwanhyeong Koo · Sunjae Yoon · Chang D. Yoo
|
ExHall D Poster #202 | |
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image
Poster Session 4
Shaoming Li · Qing Cai · Songqi KONG · Runqing Tan · Heng Tong · Shiji Qiu · Yongguo Jiang · Zhi Liu
|
ExHall D Poster #105 | |
Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation
Poster Session 2
Jiawei Fu · ZHANG Tiantian · Kai Chen · Qi Dou
|
ExHall D Poster #343 | |
Poly-Autoregressive Prediction for Modeling Interactions
Poster Session 3
Neerja Thakkar · Tara Sadjadpour · Jathushan Rajasegaran · Shiry Ginosar · Jitendra Malik
|
ExHall D Poster #167 | |
Multi-Group Proportional Representations for Text-to-Image Models
Poster Session 5
Sangwon Jung · Alex Oesterling · Claudio Mayrink Verdun · Sajani Vithana · Taesup Moon · Flavio Calmon
|
ExHall D Poster #261 | |
Quaffure: Real-Time Quasi-Static Neural Hair Simulation
Poster Session 1
Tuur Stuyck · Gene Wei-Chin Lin · Egor Larionov · Hsiaoyu Chen · Aljaž Božič · Nikolaos Sarafianos · Doug Roble
|
ExHall D Poster #7 | |
PGC: Physics-Based Gaussian Cloth from a Single Pose
Michelle Guo · Matt Jen-Yuan Chiang · Igor Santesteban · Nikolaos Sarafianos · Hsiaoyu Chen · Oshri Halimi · Aljaž Božič · Shunsuke Saito · Jiajun Wu · Karen Liu · Tuur Stuyck · Egor Larionov
|
ExHall D Poster #16 | |
Dynamic Camera Poses and Where to Find Them
Poster Session 3
Chris Rockwell · Joseph Tung · Tsung-Yi Lin · Ming-Yu Liu · David Fouhey · Chen-Hsuan Lin
|
ExHall D Poster #171 | |
Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset
Poster Session 2
Minshan XIE · Jian Lin · Hanyuan Liu · Chengze Li · Tien-Tsin Wong
|
ExHall D Poster #335 | |
One Diffusion to Generate Them All
Poster Session 1
Duong H. Le · Tuan Pham · Sangho Lee · Christopher Clark · Aniruddha Kembhavi · Stephan Mandt · Ranjay Krishna · Jiasen Lu
|
ExHall D Poster #240 | |
EditAR: Unified Conditional Generation with Autoregressive Models
Poster Session 2
Jiteng Mu · Nuno Vasconcelos · Xiaolong Wang
|
ExHall D Poster #242 | |
CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models
Poster Session 2
Felix Taubner · Ruihang Zhang · Mathieu Tuli · David B. Lindell
|
ExHall D Poster #9 | |
Causal Composition Diffusion Model for Closed-loop Traffic Generation
Poster Session 6
Haohong Lin · Xin Huang · Tung Phan-Minh · David S Hayden · Huan Zhang · DING ZHAO · Siddhartha Srinivasa · Eric M. Wolff · Hongge Chen
|
ExHall D Poster #132 | |
ExpertAF: Expert Actionable Feedback from Video
Poster Session 3
Kumar Ashutosh · Tushar Nagarajan · Georgios Pavlakos · Kris Kitani · Kristen Grauman
|
ExHall D Poster #280 | |
Rethinking Training for De-biasing Text-to-Image Generation: Unlocking the Potential of Stable Diffusion
Poster Session 3
Eunji Kim · Siwon Kim · Minjun Park · Rahim Entezari · Sungroh Yoon
|
ExHall D Poster #258 | |
ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images
Poster Session 2
Yanqing Shen · Turcan Tuna · Marco Hutter · Cesar Cadena · Nanning Zheng
|
ExHall D Poster #123 | |
CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models
Poster Session 1
Kiet A. Nguyen · Adheesh Juvekar · Tianjiao Yu · Muntasir Wahed · Ismini Lourentzou
|
ExHall D Poster #420 | |
Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights
Poster Session 3
Ondrej Tybl · Lukas Neumann
|
ExHall D Poster #404 | |
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living
Poster Session 5
Dominick Reilly · Rajatsubhra Chakraborty · Arkaprava Sinha · Manish Kumar Govind · Pu Wang · Francois Bremond · Le Xue · Srijan Das
|
ExHall D Poster #313 | |
Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement
Poster Session 5
Hesong Li · Ziqi Wu · Ruiwen Shao · Tao Zhang · Ying Fu
|
ExHall D Poster #23 | |
Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB
Nikhil Behari · Aaron Young · Siddharth Somasundaram · Tzofi Klinghoffer · Akshat Dave · Ramesh Raskar
|
ExHall D Poster #77 | |
EZSR: Event-based Zero-Shot Recognition
Poster Session 1
Yan Yang · Liyuan Pan · Dongxu Li · Liu Liu
|
ExHall D Poster #427 | |
Towards Source-Free Machine Unlearning
Poster Session 1
Sk Miraj Ahmed · Umit Basaran · Dripta S. Raychaudhuri · Arindam Dutta · Rohit Kundu · Fahim Faisal Niloy · Basak Guler · Amit K. Roy-Chowdhury
|
ExHall D Poster #457 | |
Instance-wise Supervision-level Optimization in Active Learning
Poster Session 1
Shinnosuke Matsuo · Riku Togashi · Ryoma Bise · Seiichi Uchida · Masahiro Nomura
|
ExHall D Poster #456 | |
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
Poster Session 3
Sayak Nag · Udita Ghosh · Calvin-Khang Ta · Sarosij Bose · Jiachen Li · Amit K. Roy-Chowdhury
|
ExHall D Poster #99 | |
Is `Right' Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning
Poster Session 3
JiHyeok Jung · EunTae Kim · SeoYeon Kim · Joo Ho Lee · Bumsoo Kim · Buru Chang
|
ExHall D Poster #345 | |
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks
Poster Session 4
Uranik Berisha · Jens Mehnert · Alexandru Paul Condurache
|
ExHall D Poster #408 | |
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
Poster Session 3
Uri Gadot · Shie Mannor · Assaf Shocher · Gal Chechik · Assaf Hallak
|
ExHall D Poster #179 | |
MAD: Memory-Augmented Detection of 3D Objects
Poster Session 1
Ben Agro · Sergio Casas · Patrick Wang · Thomas Gilles · Raquel Urtasun
|
ExHall D Poster #120 | |
DIO: Decomposable Implicit 4D Occupancy-Flow World Model
Poster Session 6
Christopher Diehl · Quinlan Sykora · Ben Agro · Thomas Gilles · Sergio Casas · Raquel Urtasun
|
ExHall D Poster #124 | |
CaMuViD: Calibration-Free Multi-View Detection
Poster Session 1
Amir Etefaghi Daryani · M. Usman Maqbool Bhutta · Byron Hernandez · Henry Medeiros
|
ExHall D Poster #98 | |
Boltzmann Attention Sampling for Image Analysis with Small Objects
Poster Session 5
Theodore Zhao · Sid Kiblawi · Mu Wei · Ho Hin Lee · J. Samuel Preston · Naoto Usuyama · Hoifung Poon
|
ExHall D Poster #472 | |
Faster Parameter-Efficient Tuning with Token Redundancy Reduction
Poster Session 6
Kwonyoung Kim · Jungin Park · Jin Kim · Hyeongjun Kwon · Kwanghoon Sohn
|
ExHall D Poster #387 | |
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
Poster Session 3
Thanh-Dat Truong · Utsav Prabhu · Bhiksha Raj · Jackson Cothren · Khoa Luu
|
ExHall D Poster #423 | |
SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction
Poster Session 6
Zhengyuan Li · Kai Cheng · Anindita Ghosh · Uttaran Bhattacharya · Liangyan Gui · Aniket Bera
|
ExHall D Poster #158 | |
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Poster Session 1
Roberto Henschel · Levon Khachatryan · Hayk Poghosyan · Daniil Hayrapetyan · Vahram Tadevosyan · Zhangyang Wang · Shant Navasardyan · Humphrey Shi
|
ExHall D Poster #230 | |
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds
Poster Session 5
Mohamed Abdelsamad · Michael Ulrich · Claudius Glaeser · Abhinav Valada
|
ExHall D Poster #113 | |
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models
Poster Session 4
Shuyang Hao · Bryan Hooi · Jun Liu · Kai-Wei Chang · Zi Huang · Yujun Cai
|
ExHall D Poster #389 | |
MARBLE: Material Recomposition and Blending in CLIP-Space
Poster Session 3
Ta-Ying Cheng · Prafull Sharma · Mark Boss · Varun Jampani
|
ExHall D Poster #230 | |
MIRE: Matched Implicit Neural Representations
Poster Session 2
Dhananjaya Jayasundara · Heng Zhao · Demetrio Labate · Vishal M. Patel
|
ExHall D Poster #278 | |
PolarFree: Polarization-based Reflection-Free Imaging
Poster Session 3
Mingde Yao · Menglu Wang · King Man Tam · Lingen Li · Tianfan Xue · Jinwei Gu
|
ExHall D Poster #22 | |
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Poster Session 4
Chan Hee Song · Valts Blukis · Jonathan Tremblay · Stephen Tyree · Yu Su · Stan Birchfield
|
ExHall D Poster #146 | |
3D-MVP: 3D Multiview Pretraining for Manipulation
Poster Session 5
Shengyi Qian · Kaichun Mo · Valts Blukis · David Fouhey · Dieter Fox · Ankit Goyal
|
ExHall D Poster #140 | |
UnCommon Objects in 3D
Poster Session 3
Xingchen Liu · Piyush Tayal · Jianyuan Wang · Jesus Zarzar · Tom Monnier · Konstantinos Tertikas · Jiali Duan · Antoine Toisoul · Jason Y. Zhang · Natalia Neverova · Andrea Vedaldi · Roman Shapovalov · David Novotny
|
ExHall D Poster #331 | |
Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion
Jona Ballé · Luca Versari · Emilien Dupont · Hyunjik Kim · Matthias Bauer
|
ExHall D Poster #210 | |
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Poster Session 3
Ankit Dhiman · Manan Shah · R. Venkatesh Babu
|
ExHall D Poster #56 | |
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
Poster Session 1
Rishubh Parihar · Vaibhav Agrawal · Sachidanand VS · R. Venkatesh Babu
|
ExHall D Poster #252 | |
Composing Parts for Expressive Object Generation
Poster Session 3
Harsh Rangwani · Aishwarya Agarwal · Kuldeep Kulkarni · R. Venkatesh Babu · Srikrishna Karanam
|
ExHall D Poster #244 | |
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
Poster Session 2
Rishubh Parihar · Srinjay Sarkar · Sarthak Vora · Jogendra Kundu Kundu · R. Venkatesh Babu
|
ExHall D Poster #110 | |
Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
Poster Session 4
Muhammad Hamza Mughal · Rishabh Dabral · Merel CJ Scholman · Vera Demberg · Christian Theobalt
|
ExHall D Poster #71 | |
A Bias-Free Training Paradigm for More General AI-generated Image Detection
Poster Session 4
Fabrizio Guillaro · Giada Zingarini · Ben Usman · Avneesh Sud · Davide Cozzolino · Luisa Verdoliva
|
ExHall D Poster #277 | |
Geometry Field Splatting with Gaussian Surfels
Poster Session 2
Kaiwen Jiang · Venkataram Sivaram · Cheng Peng · Ravi Ramamoorthi
|
ExHall D Poster #29 | |
Taxonomy-Aware Evaluation of Vision-Language Models
Poster Session 2
Vésteinn Snæbjarnarson · Kevin Du · Niklas Stoehr · Serge Belongie · Ryan Cotterell · Nico Lang · Stella Frank
|
ExHall D Poster #357 | |
ERUPT: Efficient Rendering with Unposed Patch Transformer
Poster Session 2
Maxim Shugaev · Vincent Chen · Maxim Karrenbach · Kyle Ashley · Bridget Kennedy · Naresh Cuntoor
|
ExHall D Poster #59 | |
TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction
Aishwarya Agarwal · Srikrishna Karanam · Vineet Gandhi
|
ExHall D Poster #389 | |
Image Reconstruction from Readout-Multiplexed Single-Photon Detector Arrays
Shashwath Bharadwaj · Ruangrawee Kitichotkul · Akshay Agarwal · Vivek K Goyal
|
ExHall D Poster #71 | |
PARC: A Quantitative Framework Uncovering the Symmetries within Vision Language Models
Poster Session 5
Jenny Schmalfuss · Nadine Chang · Vibashan VS · Maying Shen · Andrés Bruhn · Jose M. Alvarez
|
ExHall D Poster #386 | |
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
Poster Session 2
Rohith Peddi · Saurabh . · Ayush Abhay Shrivastava · Parag Singla · Vibhav Giridhar Gogate
|
ExHall D Poster #313 | |
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
Poster Session 3
Rick Akkerman · Haiwen Feng · Michael J. Black · Dimitrios Tzionas · Victoria Abrevaya
|
ExHall D Poster #173 | |
Disentangling Safe and Unsafe Image Corruptions via Anisotropy and Locality
Poster Session 2
Ramchandran Muthukumar · Ambar Pal · Jeremias Sulam · Rene Vidal
|
ExHall D Poster #436 | |
Concept Lancet: Image Editing with Compositional Representation Transplant
Poster Session 6
Jinqi Luo · Tianjiao Ding · Kwan Ho Ryan Chan · Hancheng Min · Chris Callison-Burch · Rene Vidal
|
ExHall D Poster #223 | |
Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding
Poster Session 2
Atharv Mahesh Mane · Dulanga Weerakoon · Vigneshwaran Subbaraju · Sougata Sen · Sanjay Sarma · Archan Misra
|
ExHall D Poster #349 | |
Higher-Order Ratio Cycles for Fast and Globally Optimal Shape Matching
Poster Session 5
Paul Roetzer · Viktoria Ehm · Daniel Cremers · Zorah Lähner · Florian Bernard
|
ExHall D Poster #71 | |
Practical Solutions to the Relative Pose of Three Calibrated Cameras
Poster Session 5
Charalambos Tzamos · Viktor Kocur · Yaqing Ding · Daniel Barath · Zuzana Berger Haladova · Torsten Sattler · Zuzana Kukelova
|
ExHall D Poster #82 | |
Dense Match Summarization for Faster Two-view Estimation
Poster Session 1
Jonathan Astermark · Anders Heyden · Viktor Larsson
|
ExHall D Poster #86 | |
A Regularization-Guided Equivariant Approach for Image Restoration
Poster Session 1
Yulu Bai · Jiahong Fu · Qi Xie · Deyu Meng
|
ExHall D Poster #201 | |
From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport
Poster Session 5
Quentin Bouniot · Ievgen Redko · Anton Mallasto · Charlotte Laclau · Oliver Struckmeier · Karol Arndt · Markus Heinonen · Ville Kyrki · Samuel Kaski
|
ExHall D Poster #402 | |
Bias for Action: Video Implicit Neural Representations with Bias Modulation
Poster Session 6
Alper Kayabasi · Anil Kumar Vadathya · Guha Balakrishnan · Vishwanath Saragadam
|
ExHall D Poster #174 | |
Camouflage Anything: Learning to Hide using Controlled Out-painting and Representation Engineering
Poster Session 1
Biplab Das · Viswanath Gopalakrishnan
|
ExHall D Poster #327 | |
Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models
Poster Session 5
Ronghuan Wu · Wanchao Su · Jing Liao
|
ExHall D Poster #254 | |
Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Poster Session 1
Vitor Guizilini · Zubair Irshad · Dian Chen · Greg Shakhnarovich · Rares Andrei Ambrus
|
ExHall D Poster #56 | |
ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
Poster Session 4
Shun Iwase · Zubair Irshad · Katherine Liu · Vitor Guizilini · Robert Lee · Takuya Ikeda · Ayako Amma · Koichi Nishiwaki · Kris Kitani · Rares Andrei Ambrus · Sergey Zakharov
|
ExHall D Poster #154 | |
Segmenting Maxillofacial Structures in CBCT Volumes
Poster Session 1
Federico Bolelli · Kevin Marchesini · Niels van Nistelrooij · Luca Lumetti · Vittorio Pipoli · Elisa Ficarra · Shankeeth Vinayahalingam · Costantino Grana
|
ExHall D Poster #485 | |
Zero-Shot Styled Text Image Generation, but Make It Autoregressive
Poster Session 2
Vittorio Pippi · Fabio Quattrini · Silvia Cascianelli · Alessio Tonioni · Rita Cucchiara
|
ExHall D Poster #243 | |
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
Poster Session 5
Trong-Tung Nguyen · Quang Nguyen · Khoi Nguyen · Anh Tran · Cuong Pham
|
ExHall D Poster #42 | |
Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
Poster Session 1
Phuc Nguyen · Minh Luu · Anh Tran · Cuong Pham · Khoi Nguyen
|
ExHall D Poster #330 | |
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM
Poster Session 2
Vladimir Yugay · Theo Gevers · Martin R. Oswald
|
ExHall D Poster #131 | |
h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform
Poster Session 6
Toan Nguyen · Kien Do · Duc Kieu · Thin Nguyen
|
ExHall D Poster #222 | |
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Poster Session 1
Florinel Croitoru · Vlad Hondru · Radu Tudor Ionescu · Nicu Sebe · Mubarak Shah
|
ExHall D Poster #255 | |
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Poster Session 5
Dahyun Kang · Piotr Bojanowski · Huy V. Vo · Théo Moutakanni · Cijo Jose · Federico Baldassarre · Patrick Labatut · Michael Ramamonjisoa · Maxime Oquab · Timothée Darcet · Hu Xu · Shang-Wen Li · Oriane Simeoni · Marc Szafraniec
|
ExHall D Poster #370 | |
Satellite to GroundScape - Large-scale Consistent Ground View Generation from Satellite Views
Poster Session 2
Ningli Xu · Rongjun Qin
|
ExHall D Poster #60 | |
The Power of Context: How Multimodality Improves Image Super-Resolution
Poster Session 5
Kangfu Mei · Vishal M. Patel · Mojtaba Sahraee-Ardakan · Hossein Talebi · Peyman Milanfar · Mauricio Delbracio
|
ExHall D Poster #198 | |
STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
Koushik Srivatsan · Fahad Shamshad · Muzammal Naseer · Vishal M. Patel · Karthik Nandakumar
|
ExHall D Poster #263 | |
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
Jiacong Xu · Shao-Yuan Lo · Bardia Safaei · Vishal M. Patel · Isht Dwivedi
|
ExHall D Poster #435 | |
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Poster Session 6
Sudarshan Rajagopalan · Nithin Gopalakrishnan Nair · Jay Paranjape · Vishal M. Patel
|
ExHall D Poster #189 | |
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
Bardia Safaei · Faizan Siddiqui · Jiacong Xu · Vishal M. Patel · Shao-Yuan Lo
|
ExHall D Poster #344 | |
SINR: Sparsity Driven Compressed Implicit Neural Representations
Poster Session 1
Dhananjaya Jayasundara · Sudarshan Rajagopalan · Yasiru Ranasinghe · Trac Tran · Vishal M. Patel
|
ExHall D Poster #277 | |
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Poster Session 4
Arnav Mohanty Das · Gantavya Bhatt · Lilly Kumari · Sahil Verma · Jeff Bilmes
|
ExHall D Poster #450 | |
Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events
Poster Session 5
Aditya Chinchure · Sahithya Ravi · Raymond Ng · Vered Shwartz · Boyang Li · Leonid Sigal
|
ExHall D Poster #304 | |
Low-Rank Adaptation in Multilinear Operator Networks for Security-Preserving Incremental Learning
Poster Session 5
Huu Binh Ta · Duc Nguyen · Quyen Tran · Toan Tran · Tung Pham
|
ExHall D Poster #317 | |
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks
Poster Session 4
Erik Wallin · Fredrik Kahl · Lars Hammarstrand
|
ExHall D Poster #457 | |
VideoGEM: Training-free Action Grounding in Videos
Poster Session 1
Felix Vogel · Walid Bousselham · Anna Kukleva · Nina Shvetsova · Hilde Kuehne
|
ExHall D Poster #306 | |
Noise-Resistant Video Anomaly Detection via RGB Error-Guided Multiscale Predictive Coding and Dynamic Memory
Poster Session 4
Han Hu · Wenli Du · Peng Liao · Bing Wang · Siyuan Fan
|
ExHall D Poster #316 | |
Investigating the Role of Weight Decay in Enhancing Nonconvex SGD
Poster Session 3
Tao Sun · Yuhao Huang · Li Shen · Kele Xu · Bao Wang
|
ExHall D Poster #444 | |
Exposure-slot: Exposure-centric Representations Learning with Slot-in-Slot Attention for Region-aware Exposure Correction
Poster Session 4
Donggoo Jung · DAEHYUN KIM · Guanghui Wang · Tae Hyun Kim
|
ExHall D Poster #199 | |
NoiseCtrl: A Sampling-Algorithm-Agnostic Conditional Generation Method for Diffusion Models
Poster Session 4
Longquan Dai · He Wang · Jinhui Tang
|
ExHall D Poster #218 | |
PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
Poster Session 4
Qihan Huang · Weilong Dai · Jinlong Liu · Wanggui He · Hao Jiang · Mingli Song · Jie Song
|
ExHall D Poster #245 | |
SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations
Poster Session 1
Krispin Wandel · Hesheng Wang
|
ExHall D Poster #90 | |
Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models
Poster Session 4
Jiuming Liu · Jinru Han · Lihao Liu · Angelica I Aviles-Rivero · Chaokang Jiang · Zhe Liu · Hesheng Wang
|
ExHall D Poster #174 | |
FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video
Andrea Boscolo Camiletto · Jian Wang · Eduardo Alvarado · Rishabh Dabral · Thabo Beeler · Marc Habermann · Christian Theobalt
|
ExHall D Poster #162 | |
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Poster Session 4
Jiangyi Wang · Na Zhao
|
ExHall D Poster #431 | |
Omnidirectional Multi-Object Tracking
Poster Session 5
Kai Luo · Hao Shi · Sheng Wu · Fei Teng · Mengfei Duan · Chang Huang · Yuhang Wang · Kaiwei Wang · Kailun Yang
|
ExHall D Poster #87 | |
Generative Zero-Shot Composed Image Retrieval
Poster Session 6
Lan Wang · Wei Ao · Vishnu Naresh Boddeti · Ser-Nam Lim
|
ExHall D Poster #340 | |
Vision-Language Model IP Protection via Prompt-based Learning
Poster Session 2
Lianyu Wang · Meng Wang · Huazhu Fu · Daoqiang Zhang
|
ExHall D Poster #393 | |
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts
Poster Session 4
Peijie Wang · Zhong-Zhi Li · Fei Yin · Dekang Ran · Cheng-Lin Liu
|
ExHall D Poster #356 | |
MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration
Poster Session 2
Boyun Li · Haiyu Zhao · Wenxin Wang · Peng Hu · Yuanbiao Gou · Xi Peng
|
ExHall D Poster #203 | |
DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
Poster Session 1
Yihao Wang · Marcus Klasson · Matias Turkulainen · Shuzhe Wang · Juho Kannala · Arno Solin
|
ExHall D Poster #52 | |
EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation
Poster Session 1
Daikun Liu · Lei Cheng · Teng Wang · Changyin Sun
|
ExHall D Poster #168 | |
VideoDirector: Precise Video Editing via Text-to-Video Models
Poster Session 1
Yukun Wang · Longguang Wang · Zhiyuan Ma · Qibin Hu · Kai Xu · Yulan Guo
|
ExHall D Poster #232 | |
Deep Fair Multi-View Clustering with Attention KAN
HaiMing Xu · Qianqian Wang · Boyue Wang · Quanxue Gao
|
ExHall D Poster #468 | |
OmniStereo: Real-time Omnidireactional Depth Estimation with Multiview Fisheye Cameras
Poster Session 1
Jiaxi Deng · Yushen Wang · Haitao Meng · Zuoxun Hou · Yi Chang · Gang Chen
|
ExHall D Poster #78 | |
Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Poster Session 4
He Zhu · Quyu Kong · Kechun Xu · Xunlong Xia · Bing Deng · Jieping Ye · Rong Xiong · Yue Wang
|
ExHall D Poster #148 | |
iG-6DoF: Model-free 6DoF Pose Estimation for Unseen Object via Iterative 3D Gaussian Splatting
Poster Session 2
Tuo Cao · Fei LUO · Jiongming Qin · Yu Jiang · Yusen Wang · Chunxia Xiao
|
ExHall D Poster #101 | |
Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds
Poster Session 2
Huitong Chen · Yu Wang · Yan Fan · Guosong Jiang · Qinghua Hu
|
ExHall D Poster #452 | |
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Poster Session 2
Fanding Huang · Jingyan Jiang · Qinting Jiang · Li Hebei · Faisal Nadeem Khan · Zhi Wang
|
ExHall D Poster #419 | |
Dual Prompting Image Restoration with Diffusion Transformers
Poster Session 3
Dehong Kong · Fan Li · Zhixin Wang · Jiaqi Xu · Renjing Pei · Wenbo Li · Wenqi Ren
|
ExHall D Poster #206 | |
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Poster Session 1
Yingmao Miao · Zhanpeng Huang · Rui Han · Zibin Wang · Chenhao Lin · Chao Shen
|
ExHall D Poster #18 | |
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
Divya Velayudhan · Abdelfatah Ahmed · Mohamad Alansari · Neha Gour · Abderaouf Behouch · Taimur Hassan · Syed Talal Wasim · Nabil Maalej · Muzammal Naseer · Jürgen Gall · Mohammed Bennamoun · Ernesto Damiani · Naoufel Werghi
|
ExHall D Poster #472 | |
GroupMamba: Efficient Group-Based Visual State Space Model
Poster Session 3
Abdelrahman Shaker · Syed Talal Wasim · Salman Khan · Jürgen Gall · Fahad Shahbaz Khan
|
ExHall D Poster #407 | |
ReWind: Understanding Long Videos with Instructed Learnable Memory
Poster Session 3
Anxhelo Diko · Tinghuai Wang · Wassim Swaileh · Shiyan Sun · Ioannis Patras
|
ExHall D Poster #295 | |
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Poster Session 2
Aashish Rai · Dilin Wang · Mihir Jain · Nikolaos Sarafianos · Kefan Chen · Srinath Sridhar · Aayush Prakash
|
ExHall D Poster #46 | |
IceDiff: High Resolution and High-Quality Arctic Sea Ice Forecasting with Generative Diffusion Prior
Poster Session 3
Jingyi Xu · Siwei Tu · Weidong Yang · Ben Fei · Shuhao Li · Keyi Liu · Yeqi Luo · Lipeng Ma · Lei Bai
|
ExHall D Poster #184 | |
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D
Poster Session 5
Jiawei Tan · Hongxing Wang · Junwu Weng · Jiaxin Li · Zhilong Ou · Kang Dang
|
ExHall D Poster #302 | |
Where's the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content
Poster Session 6
Haoyue Bai · Yiyou Sun · Wei Cheng · Haifeng Chen
|
ExHall D Poster #253 | |
VisionArena: 230k Real World User-VLM Conversations with Preference Labels
Poster Session 1
Christopher Chou · Lisa Dunlap · Wei-Lin Chiang · Koki Mashita · Krishna Mandal · Trevor Darrell · Ion Stoica · Joseph Gonzalez
|
ExHall D Poster #353 | |
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation
Poster Session 2
Yudi Shi · Shangzhe Di · Qirui Chen · Weidi Xie
|
ExHall D Poster #300 | |
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Poster Session 5
Dongxu Wei · Zhiqi Li · Peidong Liu
|
ExHall D Poster #121 | |
Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion
Poster Session 2
Xiangfeng Xu · Pinyi Zhang · Wenxuan Huang · Yunhang Shen · Haosheng Chen · Jingzhong Lin · Wei Li · Gaoqi He · Jiao Xie · Shaohui Lin
|
ExHall D Poster #424 | |
F-LMM: Grounding Frozen Large Multimodal Models
Poster Session 5
Size Wu · Sheng Jin · Wenwei Zhang · Lumin Xu · Wentao Liu · Wei Li · Chen Change Loy
|
ExHall D Poster #352 | |
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
Poster Session 3
Hoang Chuong Nguyen · Wei Mao · Jose M. Alvarez · Miaomiao Liu
|
ExHall D Poster #79 | |
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Poster Session 4
Felix Wimbauer · Weirong Chen · Dominik Muhle · Christian Rupprecht · Daniel Cremers
|
ExHall D Poster #85 | |
Image Generation Diversity Issues and How to Tame Them
Poster Session 1
Mischa Dombrowski · Weitong Zhang · Hadrien Reynaud · Sarah Cechnicka · Bernhard Kainz
|
ExHall D Poster #274 | |
Detail-Preserving Latent Diffusion for Stable Shadow Removal
Poster Session 2
Jiamin Xu · Yuxin Zheng · Zelong Li · Chi Wang · Renshu Gu · Weiwei Xu · Gang Xu
|
ExHall D Poster #212 | |
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training
Poster Session 1
Anjia Cao · Xing Wei · Zhiheng Ma
|
ExHall D Poster #373 | |
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
Poster Session 2
Hongyu Li · Jinyu Chen · Ziyu Wei · Shaofei Huang · Tianrui Hui · Jialin Gao · Xiaoming Wei · Si Liu
|
ExHall D Poster #307 | |
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li · Mingwang Xu · Qingkun Su · Shan Mu · Li jiaye · Kaihui Cheng · Chen Yuxuan · Tan Chen · Mao Ye · Jingdong Wang · Siyu Zhu
|
ExHall D Poster #228 | |
AMO Sampler: Enhancing Text Rendering with Overshooting
Poster Session 3
Xixi Hu · Keyang Xu · Bo Liu · Hongliang Fei · Qiang Liu
|
ExHall D Poster #239 | |
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
Poster Session 1
Huan Zheng · Wencheng Han · Jianbing Shen
|
ExHall D Poster #73 | |
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions
Poster Session 3
Wang Yu-Hang · Junkang Guo · Aolei Liu · Kaihao Wang · Zaitong Wu · Zhenyu Liu · Wenfei Yin · Jian Liu
|
ExHall D Poster #462 | |
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Poster Session 5
Hou-I Liu · Christine Wu · Jen-Hao Cheng · Wenhao Chai · Shian-yun Wang · Gaowen Liu · Hugo Latapie · Jhih-Ciang Wu · Jenq-Neng Hwang · Hong-Han Shuai · Wen-Huang Cheng
|
ExHall D Poster #116 | |
Geometric Knowledge-Guided Localized Global Distribution Alignment for Federated Learning
Poster Session 5
Yanbiao Ma · Wei Dai · Wenke Huang · Jiayi Chen
|
ExHall D Poster #443 | |
SEAL: Semantic Attention Learning for Long Video Representation
Poster Session 6
Lan Wang · Yujia Chen · Wen-Sheng Chu · Vishnu Naresh Boddeti · Du Tran
|
ExHall D Poster #268 | |
MERGE: Multi-faceted Hierarchical Graph-based GNN for Gene Expression Prediction from Whole Slide Histopathology Images
Poster Session 3
Aniruddha Ganguly · Debolina Chatterjee · Wentao Huang · Jie Zhang · Alisa Yurovsky · Travis Steele Johnson · Chao Chen
|
ExHall D Poster #475 | |
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
Poster Session 2
Xiaoqin Wang · Xusen Ma · Xianxu Hou · Meidan Ding · Yudong Li · Junliang Chen · Wenting Chen · Xiaoyang Peng · Linlin Shen
|
ExHall D Poster #361 | |
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
Poster Session 4
Yahan Tu · Rui Hu · Jitao Sang
|
ExHall D Poster #384 | |
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Poster Session 5
Yingping Liang · Yutao Hu · Wenqi Shao · Ying Fu
|
ExHall D Poster #115 | |
RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting
Poster Session 4
Qiyu Dai · Xingyu Ni · Qianfan Shen · Mengyu Chu · Wenzheng Chen · Baoquan Chen
|
ExHall D Poster #29 | |
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data
Poster Session 2
Runjian Chen · Wenqi Shao · Bo Zhang · Shaoshuai Shi · Li Jiang · Ping Luo
|
ExHall D Poster #136 | |
Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation
Poster Session 2
Hongmei Yin · Tingliang Feng · Fan Lyu · Fanhua Shang · Hongying Liu · Wei Feng · Liang Wan
|
ExHall D Poster #425 | |
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
Poster Session 3
Fengxiang Wang · hongzhen wang · Zonghao Guo · Di Wang · Yulin Wang · Mingshuo Chen · Qiang Ma · Long Lan · Wenjing Yang · Jing Zhang · Zhiyuan Liu · Maosong Sun
|
ExHall D Poster #351 | |
Efficient Personalization of Quantized Diffusion Model without Backpropagation
Poster Session 2
Hoigi Seo · Wongi Jeong · Kyungryeol Lee · Se Young Chun
|
ExHall D Poster #225 | |
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
Poster Session 3
Aviral Chharia · Wenbo Gou · Haoye Dong
|
ExHall D Poster #91 | |
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
Poster Session 5
Weixiao Gao · Liangliang Nan · Hugo Ledoux
|
ExHall D Poster #329 | |
Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Poster Session 2
Youngjoon Jang · Haran Raajesh · Liliane Momeni · Gul Varol · Andrew Zisserman
|
ExHall D Poster #322 | |
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
Poster Session 6
Md Mohaiminul Islam · Tushar Nagarajan · Huiyu Wang · Gedas Bertasius · Lorenzo Torresani
|
ExHall D Poster #282 | |
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Poster Session 4
Xiang Gao · Shuai Yang · Jiaying Liu
|
ExHall D Poster #233 | |
Multi-subject Open-set Personalization in Video Generation
Poster Session 2
Tsai-Shien Chen · Aliaksandr Siarohin · Willi Menapace · Yuwei Fang · Kwot Sin Lee · Ivan Skorokhodov · Kfir Aberman · Jun-Yan Zhu · Ming-Hsuan Yang · Sergey Tulyakov
|
ExHall D Poster #63 | |
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
Chaoyang Wang · Peiye Zhuang · Tuan Duc Ngo · Willi Menapace · Aliaksandr Siarohin · Michael Vasilkovsky · Ivan Skorokhodov · Sergey Tulyakov · Peter Wonka · Hsin-Ying Lee
|
ExHall D Poster #183 | |
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Poster Session 5
Sherwin Bahmani · Ivan Skorokhodov · Guocheng Qian · Aliaksandr Siarohin · Willi Menapace · Andrea Tagliasacchi · David B. Lindell · Sergey Tulyakov
|
ExHall D Poster #173 | |
Temporal Alignment-Free Video Matching for Few-shot Action Recognition
Poster Session 2
SuBeen Lee · WonJun Moon · Hyun Seok Seong · Jae-Pil Heo
|
ExHall D Poster #302 | |
Foveated Instance Segmentation
Poster Session 5
Hongyi Zeng · Wenxuan Liu · Tianhua Xia · Jinhui Chen · Ziyun Li · Sai Qian Zhang
|
ExHall D Poster #331 | |
Zero-Shot Head Swapping in Real-World Scenarios
Poster Session 3
Sohyun Jeong · Taewoong Kang · Hyojin Jang · Jaegul Choo
|
ExHall D Poster #14 | |
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Poster Session 4
Fei Xie · Jiahao Nie · Yujin Tang · Wenkang Zhang · Hongshen Zhao
|
ExHall D Poster #412 | |
VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models
Poster Session 2
Muchao Ye · Weiyang Liu · Pan He
|
ExHall D Poster #316 | |
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
Poster Session 5
Hyojun Go · byeongjun park · Jiho Jang · Jin-Young Kim · Soonwoo Kwon · Changick Kim
|
ExHall D Poster #45 | |
DRAWER: Digital Reconstruction and Articulation With Environment Realism
Poster Session 5
Hongchi Xia · Entong Su · Marius Memmel · Arhan Jain · Raymond Yu · Numfor Mbiziwo-Tiapo · Ali Farhadi · Abhishek Gupta · Shenlong Wang · Wei-Chiu Ma
|
ExHall D Poster #68 | |
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
Poster Session 3
Weixi Feng · Chao Liu · Sifei Liu · William Yang Wang · Arash Vahdat · Weili Nie
|
ExHall D Poster #223 | |
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds
Poster Session 1
Eitan Shaar · Ariel Shaulov · Gal Chechik · Lior Wolf
|
ExHall D Poster #285 | |
Learned Binocular-Encoding Optics for RGBD Imaging Using Joint Stereo and Focus Cues
Poster Session 4
Yuhui Liu · Liangxun Ou · Qiang Fu · Hadi Amata · Wolfgang Heidrich · YIFAN PENG
|
ExHall D Poster #22 | |
Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Poster Session 1
Inho Kim · YOUNGKIL SONG · Jicheol Park · Won Hwa Kim · Suha Kwak
|
ExHall D Poster #283 | |
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Poster Session 5
Weimin Qiu · Jieke Wang · Meng Tang
|
ExHall D Poster #236 | |
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Poster Session 1
Wonbong Jang · Philippe Weinzaepfel · Vincent Leroy · Lourdes Agapito · Jerome Revaud
|
ExHall D Poster #84 | |
DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting
Poster Session 5
Hyunwoo Park · Gun Ryu · Wonjun Kim
|
ExHall D Poster #52 | |
Open-World Amodal Appearance Completion
Poster Session 2
Jiayang Ao · Yanbei Jiang · Qiuhong Ke · Krista A. Ehinger
|
ExHall D Poster #106 | |
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI
Poster Session 5
Won Jun Kim · Hyungjin Chung · Jaemin Kim · Sangmin Lee · Byeongsu Sim · Jong Chul Ye
|
ExHall D Poster #266 | |
ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects
Poster Session 2
Woojin Lee · Hyugjae Chang · Jaeho Moon · Jaehyup Lee · Munchurl Kim
|
ExHall D Poster #332 | |
GeoAvatar: Geometrically-Consistent Multi-Person Avatar Reconstruction from Sparse Multi-View Videos
Poster Session 5
SooHyun Lee · SeoYeon Kim · HeeKyung Lee · Won-Sik Cheong · Joo Ho Lee
|
ExHall D Poster #9 | |
MoDec-GS: Global-to-Local Motion Decomposition and Temporal Interval Adjustment for Compact Dynamic 3D Gaussian Splatting
Poster Session 3
Sangwoon Kwak · Joonsoo Kim · Jun Young Jeong · Won-Sik Cheong · Jihyong Oh · Munchurl Kim
|
ExHall D Poster #65 | |
HOT: Hadamard-based Optimized Training
Poster Session 1
Seonggon Kim · Juncheol Shin · Seung-taek Woo · Eunhyeok Park
|
ExHall D Poster #442 | |
Sampling Innovation-Based Adaptive Compressive Sensing
Poster Session 1
Zhifu Tian · Tao Hu · Chaoyang Niu · Di Wu · Shu Wang
|
ExHall D Poster #209 | |
Scalable Autoregressive Monocular Depth Estimation
Poster Session 2
Jinhong Wang · Jintai Chen · Jian liu · Dongqi Tang · Wentong Li · Weiqiang Wang · Danny Chen · Jian Wu
|
ExHall D Poster #79 | |
DiskVPS: Vanishing Point Detector via Hough Transform in a Disk Region
Poster Session 6
Jianping Wu
|
ExHall D Poster #86 | |
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation
Poster Session 5
Ningyuan Tang · Minghao Fu · Jianxin Wu
|
ExHall D Poster #398 | |
FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts
Poster Session 2
Tongyuan Bai · Wangyuanfan Bai · Dong Chen · Tieru Wu · Manyi Li · Rui Ma
|
ExHall D Poster #43 | |
DriveScape: High-Resolution Driving Video Generation by Multi-View Feature Fusion
Poster Session 4
Wei Wu · Xi Guo · Weixuan TANG · Tingxuan Huang · Chiyu Wang · Chenjing Ding
|
ExHall D Poster #131 | |
DistinctAD: Distinctive Audio Description Generation in Contexts
Bo Fang · Wenhao Wu · Qiangqiang Wu · YuXin Song · Antoni B. Chan
|
ExHall D Poster #279 | |
Towards Autonomous Micromobility through Scalable Urban Simulation
Wayne Wu · Honglin He · Chaoyuan Zhang · Jack He · Seth Z. Zhao · Ran Gong · Quanyi Li · Bolei Zhou
|
ExHall D Poster #133 | |
Synthetic Data is an Elegant GIFT for Continual Vision-Language Models
Poster Session 1
Bin Wu · Wuxuan Shi · Jinqiao Wang · Mang Ye
|
ExHall D Poster #254 | |
Event-Equalized Dense Video Captioning
Poster Session 2
Kangyi Wu · Pengna Li · Jingwen Fu · Yizhe Li · Yang Wu · Yuhan Liu · Jinjun Wang · Sanping Zhou
|
ExHall D Poster #291 | |
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Poster Session 5
Jingcheng Ni · Yuxin Guo · Yichen Liu · Rui Chen · Lewei Lu · Zehuan Wu
|
ExHall D Poster #127 | |
dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis
Poster Session 2
Luyuan Xie · Tianyu Luan · Wenyuan Cai · Guochen Yan · Zhaoyu Chen · Nan Xi · Yuejian Fang · Qingni Shen · Zhonghai Wu · Junsong Yuan
|
ExHall D Poster #460 | |
Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
Poster Session 3
Andong Deng · Zhongpai Gao · Anwesa Choudhuri · Benjamin Planche · Meng Zheng · Bin Wang · Terrence Chen · Chen Chen · Ziyan Wu
|
ExHall D Poster #298 | |
3D-AVS: LiDAR-based 3D Auto-Vocabulary Segmentation
Poster Session 2
Weijie Wei · Osman Ülger · Fatemeh Karimi Nejadasl · Theo Gevers · Martin R. Oswald
|
ExHall D Poster #339 | |
SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks
Shining Wang · Yunlong Wang · Ruiqi Wu · Bingliang Jiao · Wenxuan Wang · Peng Wang
|
ExHall D Poster #102 | |
Learning Conditional Space-Time Prompt Distributions for Video Class-Incremental Learning
Xiaohan Zou · Wenchao Ma · Shu Zhao
|
ExHall D Poster #449 | |
Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation
Poster Session 2
Zhaoyang Li · Yuan Wang · Wangkai Li · Tianzhu Zhang · Xiang Liu
|
ExHall D Poster #426 | |
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction
Yuan Wang · Yali Li · Lixiang Li · Shengjin Wang
|
ExHall D Poster #171 | |
DFM: Differentiable Feature Matching for Anomaly Detection
Poster Session 3
Wu Sheng · Yimi Wang · Xudong Liu · Yuguang Yang · Runqi Wang · Guodong Guo · David Doermann · Baochang Zhang
|
ExHall D Poster #438 | |
Towards Precise Embodied Dialogue Localization via Causality Guided Diffusion
Poster Session 3
Haoyu Wang · Le Wang · Sanping Zhou · Jingyi Tian · Zheng Qin · Yabing Wang · Gang Hua · Wei Tang
|
ExHall D Poster #257 | |
CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation
Poster Session 5
Kai Fang · Anqi Zhang · Guangyu Gao · Jianbo Jiao · Chi Harold Liu · Yunchao Wei
|
ExHall D Poster #442 | |
BiM-VFI: Bidirectional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions
Poster Session 2
Wonyong Seo · Jihyong Oh · Munchurl Kim
|
ExHall D Poster #180 | |
BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects
Poster Session 6
Wanyue Zhang · Rishabh Dabral · Vladislav Golyanik · Vasileios Choutas · Eduardo Alvarado · Thabo Beeler · Marc Habermann · Christian Theobalt
|
ExHall D Poster #146 | |
Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding
Poster Session 5
Jiaxin Shi · Mingyue Xiang · Hao Sun · Yixuan Huang · Zhi Weng
|
ExHall D Poster #338 | |
Link-based Contrastive Learning for One-Shot Unsupervised Domain Adaptation
Poster Session 1
Yue Zhang · Mingyue Bin · Yuyang Zhang · Zhongyuan Wang · Zhen Han · Chao Liang
|
ExHall D Poster #454 | |
MEGA: Masked Generative Autoencoder for Human Mesh Recovery
Poster Session 2
Guénolé Fiche · Simon Leglaive · Xavier Alameda-Pineda · Francesc Moreno-Noguer
|
ExHall D Poster #90 | |
Scene Map-based Prompt Tuning for Navigation Instruction Generation
Poster Session 2
Sheng Fan · Rui Liu · Wenguan Wang · Yi Yang
|
ExHall D Poster #146 | |
MMRL: Multi-Modal Representation Learning for Vision-Language Models
Poster Session 5
Yuncheng Guo · Xiaodong Gu
|
ExHall D Poster #380 | |
Label Shift Meets Online Learning: Ensuring Consistent Adaptation with Universal Dynamic Regret
Yucong Dai · Shilin Gu · Ruidong Fan · Chao Xu · Chenping Hou
|
ExHall D Poster #454 | |
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation
Poster Session 4
Ali Athar · Xueqing Deng · Liang-Chieh Chen
|
ExHall D Poster #308 | |
All-directional Disparity Estimation for Real-world QPD Images
Hongtao Yu · Shaohui Song · Lihu Sun · Wenkai Su · Xiaodong Yang · Chengming Liu
|
ExHall D Poster #75 | |
Quad-Pixel Image Defocus Deblurring: A New Benchmark and Model
Poster Session 2
Hang Chen · Yin Xie · Xiaoxiu Peng · Lihu Sun · Wenkai Su · Xiaodong Yang · Chengming Liu
|
ExHall D Poster #25 | |
Simplification Is All You Need against Out-of-Distribution Overconfidence
Poster Session 1
Keke Tang · Chao Hou · Weilong Peng · Xiang Fang · Zhize Wu · Yongwei Nie · Wenping Wang · Zhihong Tian
|
ExHall D Poster #465 | |
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Poster Session 4
Jingfeng Yao · Bin Yang · Xinggang Wang
|
ExHall D Poster #371 | |
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation
Poster Session 3
Yongkang Li · Tianheng Cheng · Bin Feng · Wenyu Liu · Xinggang Wang
|
ExHall D Poster #416 | |
Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
Poster Session 2
Huakai Lai · Guoxin Xiong · Huayu Mai · Xiang Liu · Tianzhu Zhang
|
ExHall D Poster #368 | |
MC^2: Multi-concept Guidance for Customized Multi-concept Generation
Poster Session 1
Jiaxiu Jiang · Yabo Zhang · Kailai Feng · Xiaohe Wu · Wenbo Li · Renjing Pei · Fan Li · Wangmeng Zuo
|
ExHall D Poster #253 | |
KAC: Kolmogorov-Arnold Classifier for Continual Learning
Yusong Hu · Zichen Liang · Fei Yang · Qibin Hou · Xialei Liu · Ming-Ming Cheng
|
ExHall D Poster #445 | |
Spherical Manifold Guided Diffusion Model for Panoramic Image Generation
Poster Session 2
Xiancheng Sun · Mai Xu · Shengxi Li · Senmao Ma · Xin Deng · Lai Jiang · Shen gang
|
ExHall D Poster #36 | |
A Unified Image-Dense Annotation Generation Model for Underwater Scenes
Poster Session 1
Hongkai Lin · Dingkang Liang · Zhenghao Qi · Xiang Bai
|
ExHall D Poster #74 | |
Decouple Distortion from Perception: Region Adaptive Diffusion for Extreme-low Bitrate Perception Image Compression
Poster Session 4
Jinchang Xu · Shaokang Wang · Jintao Chen · Zhe Li · Peidong Jia · Fei Zhao · Guoqing Xiang · Zhijian Hao · Shanghang Zhang · Xiaodong Xie
|
ExHall D Poster #214 | |
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation
Poster Session 4
Leigang Qu · Haochuan Li · Wenjie Wang · Xiang Liu · Juncheng Li · Liqiang Nie · Tat-seng Chua
|
ExHall D Poster #260 | |
Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network
Poster Session 5
Haifeng Zhang · Qinghui He · Xiuli Bi · Weisheng Li · Bo Liu · Bin Xiao
|
ExHall D Poster #269 | |
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
Poster Session 5
Yujie Liang · Xiaobin Hu · Boyuan Jiang · Donghao Luo · Xu Peng · Kai WU · Chengming Xu · Wenhui Han · Taisong Jin · Chengjie Wang · Rongrong Ji
|
ExHall D Poster #148 | |
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Poster Session 4
Xiaoding Yuan · Shitao Tang · Kejie Li · Peng Wang
|
ExHall D Poster #54 | |
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Poster Session 6
Pengcheng Xu · Boyuan Jiang · Xiaobin Hu · Donghao Luo · Qingdong He · Jiangning Zhang · Chengjie Wang · Yunsheng Wu · Charles Ling · Boyu Wang
|
ExHall D Poster #221 | |
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Poster Session 1
Haoyang He · Jiangning Zhang · Yuxuan Cai · Hongxu Chen · Xiaobin Hu · Zhenye Gan · Yabiao Wang · Chengjie Wang · Yunsheng Wu · Lei Xie
|
ExHall D Poster #415 | |
SDBF: Steep-Decision-Boundary Fingerprinting for Hard-Label Tampering Detection of DNN Models
Poster Session 6
Xiaofan Bai · Shixin Li · Xiaojing Ma · Bin Benjamin Zhu · Dongmei Zhang · Linchen Yu
|
ExHall D Poster #299 | |
An Image-like Diffusion Method for Human-Object Interaction Detection
Poster Session 3
Xiaofei Hui · Haoxuan Qu · Hossein Rahmani · Jun Liu
|
ExHall D Poster #321 | |
M-LLM Based Video Frame Selection for Efficient Video Understanding
Poster Session 3
Kai Hu · Feng Gao · Xiaohan Nie · Peng Zhou · Son Dinh Tran · Tal Neiman · Lingyun Wang · Mubarak Shah · Raffay Hamid · Bing Yin · Trishul Chilimbi
|
ExHall D Poster #292 | |
Samba: A Unified Mamba-based Framework for General Salient Object Detection
Jiahao He · Keren Fu · Xiaohong Liu · Qijun Zhao
|
ExHall D Poster #408 | |
Activating Sparse Part Concepts for 3D Class Incremental Learning
Poster Session 6
Zhenya Tian · Jun Xiao · Liu lupeng · Haiyong Jiang
|
ExHall D Poster #402 | |
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning
Poster Session 5
Lei-Lei Ma · Shuo Xu · Ming-Kun Xie · Lei Wang · Dengdi Sun · Haifeng Zhao
|
ExHall D Poster #419 | |
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
Poster Session 1
Yuheng Ji · Huajie Tan · Jiayu Shi · Xiaoshuai Hao · Yuan Zhang · Hengyuan Zhang · Pengwei Wang · Mengdi Zhao · Yao Mu · Pengju An · Xinda Xue · Qinghang Su · Huaihai Lyu · Xiaolong Zheng · Jiaming Liu · Zhongyuan Wang · Shanghang Zhang
|
ExHall D Poster #145 | |
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
Poster Session 5
Xiaolu Liu · Ruizi Yang · Song Wang · Wentong Li · Junbo Chen · Jianke Zhu
|
ExHall D Poster #125 | |
Robotic Visual Instruction
Poster Session 3
Yanbang Li · ZiYang Gong · Haoyang Li · Xiaoqi Huang · Haolan Kang · Guangpingbai · Xianzheng Ma
|
ExHall D Poster #145 | |
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
Poster Session 6
Ziang Li · Hongguang Zhang · Juan Wang · Meihui Chen · Hongxin Hu · Wenzhe Yi · Xiaoyang Xu · Mengda Yang · Chenjun Ma
|
ExHall D Poster #300 | |
From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Poster Session 2
Mingyang Song · Xiaoye Qu · Jiawei Zhou · Yu Cheng
|
ExHall D Poster #387 | |
TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
Poster Session 1
Chun Gu · Xiaofei Wei · Li Zhang · Xiatian Zhu
|
ExHall D Poster #31 | |
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels
Poster Session 1
Qiming Xia · Wenkai Lin · Haoen Xiang · Xun Huang · Siheng Chen · Zhen Dong · Cheng Wang · Chenglu Wen
|
ExHall D Poster #116 | |
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
Poster Session 4
Bin Wang · Fan Wu · Linke Ouyang · Zhuangcheng Gu · Rui Zhang · Renqiu Xia · Botian Shi · Bo Zhang · Conghui He
|
ExHall D Poster #369 | |
RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories
Poster Session 4
Huiyang Shao · Xin Xia · Yuhong Yang · Ren Yuxi · XING WANG · Xuefeng Xiao
|
ExHall D Poster #220 | |
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
Poster Session 6
Wei-Jin Huang · Yuan-Ming Li · Zhi-Wei Xia · Yu-Ming Tang · Kun-Yu Lin · Jian-Fang Hu · Wei-Shi Zheng
|
ExHall D Poster #155 | |
Rotation-Equivariant Self-Supervised Method in Image Denoising
Poster Session 3
Hanze Liu · Jiahong Fu · Qi Xie · Deyu Meng
|
ExHall D Poster #198 | |
MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities
Poster Session 6
Bizhu Wu · Jinheng Xie · Keming Shen · Zhe Kong · Jianfeng Ren · Ruibin Bai · Rong Qu · Linlin Shen
|
ExHall D Poster #160 | |
Convex Combination Star Shape Prior for Data-driven Image Semantic Segmentation
Poster Session 3
Xinyu Zhao · Jun Xie · Shengzhe Chen · Jun Liu
|
ExHall D Poster #327 | |
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Poster Session 5
Junxi Chen · Junhao Dong · Xiaohua Xie
|
ExHall D Poster #265 | |
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Poster Session 4
Fu Feng · Yucheng Xie · Xu Yang · Jing Wang · Xin Geng
|
ExHall D Poster #255 | |
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
Poster Session 1
Fu Feng · Yucheng Xie · Jing Wang · Xin Geng
|
ExHall D Poster #445 | |
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
Poster Session 5
Jingyi Xu · Xieyuanli Chen · Junyi Ma · Jiawei Huang · Jintao Xu · Yue Wang · Ling Pei
|
ExHall D Poster #123 | |
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification
Poster Session 1
Jingwei Zhang · Anh Tien Nguyen · Xi Han · Vincent Quoc-Huy Trinh · Hong Qin · Dimitris Samaras · Mahdi Hosseini
|
ExHall D Poster #325 | |
TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model
Poster Session 5
Meilong Xu · Saumya Gupta · Xiaoling Hu · Chen Li · Shahira Abousamra · Dimitris Samaras · Prateek Prasanna · Chao Chen
|
ExHall D Poster #458 | |
MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework
Poster Session 1
Ping Guo · Cheng Gong · Fei Liu · Xi Lin · Zhichao Lu · Qingfu Zhang · Zhenkun Wang
|
ExHall D Poster #466 | |
MambaOut: Do We Really Need Mamba for Vision?
Poster Session 1
Weihao Yu · Xinchao Wang
|
ExHall D Poster #414 | |
3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement
Poster Session 4
Yihang Luo · Shangchen Zhou · Yushi Lan · Xingang Pan · Chen Change Loy
|
ExHall D Poster #56 | |
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Poster Session 1
Dubing Chen · Huan Zheng · Jin Fang · Xingping Dong · Xianfei Li · Wenlong Liao · Tao He · Pai Peng · Jianbing Shen
|
ExHall D Poster #125 | |
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation
Poster Session 6
Chuandong Liu · Xingxing Weng · Shuguo Jiang · Pengcheng Li · Lei Yu · Gui-Song Xia
|
ExHall D Poster #117 | |
HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models
Poster Session 2
Mingzhen Huang · Fu-Jen Chu · Bugra Tekin · Kevin Liang · Haoyu Ma · Weiyao Wang · Xingyu Chen · Pierre Gleize · Hongfei Xue · Siwei Lyu · Kris Kitani · Matt Feiszli · Hao Tang
|
ExHall D Poster #170 | |
Science-T2I: Addressing Scientific Illusions in Image Synthesis
Poster Session 1
Jialuo Li · Wenhao Chai · XINGYU FU · Haiyang Xu · Saining Xie
|
ExHall D Poster #247 | |
EAP-GS: Efficient Augmentation of Pointcloud for 3D Gaussian Splatting in Few-shot Scene Reconstruction
Poster Session 4
Dongrui Dai · Yuxiang Xing
|
ExHall D Poster #63 | |
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment
Poster Session 5
Yan Li · Yifei Xing · Xiangyuan Lan · Xin Li · Haifeng Chen · Dongmei Jiang
|
ExHall D Poster #358 | |
V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection
Poster Session 6
Xun Huang · Jinlong Wang · Qiming Xia · Siheng Chen · Bisheng Yang · Xin Li · Cheng Wang · Chenglu Wen
|
ExHall D Poster #118 | |
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution
Poster Session 4
Xin Liu · Jie Liu · Jie Tang · Gangshan Wu
|
ExHall D Poster #200 | |
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning
Poster Session 5
Yuheng Xu · Shijie Yang · Xin Liu · Jie Liu · Jie Tang · Gangshan Wu
|
ExHall D Poster #197 | |
DyMO: Training-Free Diffusion Model Alignment with Dynamic Multi-Objective Scheduling
Poster Session 3
Xin Xie · Dong Gong
|
ExHall D Poster #245 | |
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye · Burhaneddin Yaman · Sheng Cheng · Feng Tao · Abhirup Mallik · Liu Ren
|
ExHall D Poster #124 | |
DNF: Unconditional 4D Generation with Dictionary-based Neural Fields
Poster Session 6
Xinyi Zhang · Naiqi Li · Angela Dai
|
ExHall D Poster #12 | |
Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model
Poster Session 5
Shuyun Wang · Hu Zhang · Xin Shen · Dadong Wang · Xin Yu
|
ExHall D Poster #182 | |
M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings
Poster Session 3
Qingzheng Xu · Ru Cao · Xin Shen · Heming Du · Sen Wang · Xin Yu
|
ExHall D Poster #157 | |
AKiRa: Augmentation Kit on Rays for Optical Video Generation
Poster Session 1
Xi Wang · Robin Courant · Marc Christie · Vicky Kalogeiton
|
ExHall D Poster #234 | |
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
Poster Session 3
Yuanmin Tang · Jue Zhang · Xiaoting Qin · Jing Yu · Gaopeng Gou · Gang Xiong · Qingwei Lin · Saravan Rajmohan · Dongmei Zhang · Qi Wu
|
ExHall D Poster #359 | |
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Poster Session 1
Hongjun Wang · Wonmin Byeon · Jiarui Xu · Jinwei Gu · Ka Chun Cheung · Jan Kautz · Xiaolong Wang · Kai Han · Sifei Liu
|
ExHall D Poster #413 | |
PointSR: Self-Regularized Point Supervision for Drone-View Object Detection
Poster Session 3
Weizhuo Li · Yue Xi · Wenjing Jia · zehao zhang · Fei Li · Xiangzeng Liu · Qiguang Miao
|
ExHall D Poster #102 | |
Towards RAW Object Detection in Diverse Conditions
Zhong-Yu Li · Xin Jin · Bo-Yuan Sun · Chun-Le Guo · Ming-Ming Cheng
|
ExHall D Poster #333 | |
SCAP: Transductive Test-Time Adaptation via Supportive Clique-based Attribute Prompting
Poster Session 6
Chenyu Zhang · Kunlun Xu · Zichen Liu · Yuxin Peng · Jiahuan Zhou
|
ExHall D Poster #371 | |
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Poster Session 6
Vikash Sehwag · Xianghao Kong · Jingtao Li · Michael Spranger · Lingjuan Lyu
|
ExHall D Poster #232 | |
WonderWorld: Interactive 3D Scene Generation from a Single Image
Hong-Xing Yu · Haoyi Duan · Charles Herrmann · William Freeman · Jiajun Wu
|
ExHall D Poster #45 | |
POMP: Physics-constrainable Motion Generative Model through Phase Manifolds
Poster Session 5
Bin Ji · Ye Pan · zhimeng Liu · Shuai Tan · Xiaogang Jin · Xiaokang Yang
|
ExHall D Poster #155 | |
Hearing Anywhere in Any Environment
Poster Session 2
Xiulong Liu · Anurag Kumar · Paul Calamia · Sebastia Vicenc Amengual Gari · Calvin Murdock · Ishwarya Ananthabhotla · Philip W Robinson · Eli Shlizerman · Vamsi Krishna Ithapu · Ruohan Gao
|
ExHall D Poster #27 | |
ICP: Immediate Compensation Pruning for Mid-to-high Sparsity
Xin Luo · Fu Xueming · Zihang Jiang · S Kevin Zhou
|
ExHall D Poster #392 | |
Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation
Poster Session 4
Yueru Jia · Jiaming Liu · Sixiang Chen · Chenyang Gu · Zhilve Wang · Xiaoqi Li · Longzan Luo · Pengwei Wang · Renrui Zhang · Zhongyuan Wang · Shanghang Zhang
|
ExHall D Poster #149 | |
PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes
Bin Tan · Rui Yu · Yujun Shen · Nan Xue
|
ExHall D Poster #95 | |
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Yiheng Li · RuiBing Hou · Hong Chang · Shiguang Shan · Xilin Chen
|
ExHall D Poster #156 | |
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
Poster Session 6
Tong Wang · Ting Liu · Xiaochao Qu · WU CHENGJING · Luoqi Liu · Xiaolin Hu
|
ExHall D Poster #225 | |
UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly Detection
Poster Session 2
Shun Wei · Jielin Jiang · Xiaolong Xu
|
ExHall D Poster #440 | |
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
Xinyuan Chang · Maixuan Xue · Xinran Liu · Zheng Pan · Xing Wei
|
ExHall D Poster #139 | |
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification
Jiayu Jiang · Changxing Ding · Wentao Tan · Junhong Wang · JIN Tao · Xiangmin Xu
|
ExHall D Poster #367 | |
Optimizing for the Shortest Path in Denoising Diffusion Model
Ping Chen · Xingpeng Zhang · Zhaoxiang Liu · Huan Hu · Xiang Liu · Kai Wang · Min Wang · Yanlin Qian · Shiguo Lian
|
ExHall D Poster #211 | |
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Poster Session 1
Datao Tang · Xiangyong Cao · Xuan Wu · Jialin Li · Jing Yao · Xueru Bai · Dongsheng Jiang · Yin Li · Deyu Meng
|
ExHall D Poster #328 | |
Learning Partonomic 3D Reconstruction from Image Collections
Poster Session 6
Xiaoqian Ruan · Pei Yu · Dian Jia · Hyeonjeong Park · Peixi Xiong · Wei Tang
|
ExHall D Poster #56 | |
MDP: Multidimensional Vision Model Pruning with Latency Constraint
Poster Session 4
Xinglong Sun · Barath Lakshmanan · Maying Shen · Shiyi Lan · Jingde Chen · Jose M. Alvarez
|
ExHall D Poster #411 | |
3D Dental Model Segmentation with Geometrical Boundary Preserving
Poster Session 2
Shufan Xi · Zexian Liu · Junlin Chang · Hongyu Wu · Xiaogang Wang · Aimin Hao
|
ExHall D Poster #486 | |
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Poster Session 2
Ruicheng Wang · Sicheng Xu · Cassie Lee Dai · Jianfeng XIANG · Yu Deng · Xin Tong · Jiaolong Yang
|
ExHall D Poster #113 | |
Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries
Poster Session 1
Wei Xu · Charlie Wagner · Junjie Luo · Qi Guo
|
ExHall D Poster #25 | |
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment
Poster Session 6
Ziteng Cui · Xuangeng Chu · Tatsuya Harada
|
ExHall D Poster #27 | |
PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction
Poster Session 6
Mingzhi Pei · Xu Cao · Xiangyi Wang · Heng Guo · Zhanyu Ma
|
ExHall D Poster #66 | |
Segment Any Motion in Videos
Poster Session 1
Nan Huang · Wenzhao Zheng · Chenfeng Xu · Kurt Keutzer · Shanghang Zhang · Angjoo Kanazawa · Qianqian Wang
|
ExHall D Poster #309 | |
Bridging Viewpoint Gaps: Geometric Reasoning Boosts Semantic Correspondence
Poster Session 3
Qiyang Qian · Hansheng Chen · Masayoshi Tomizuka · Kurt Keutzer · Qianqian Wang · Chenfeng Xu
|
ExHall D Poster #90 | |
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures
Poster Session 6
Hui Liu · Chen Jia · Fan Shi · Xu Cheng · Shengyong Chen
|
ExHall D Poster #311 | |
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
Poster Session 2
Sibo Wu · Congrong Xu · Binbin Huang · Andreas Geiger · Anpei Chen
|
ExHall D Poster #61 | |
Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference
Poster Session 4
Wenhao Shen · Mingliang Zhou · Yu Chen · Xuekai WEI · Yong Feng · Huayan Pu · Weijia Jia
|
ExHall D Poster #208 | |
MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output
Poster Session 5
Yanyuan Chen · Dexuan Xu · Yu Huang · Songkun Zhan · Hanpin Wang · Dongxue Chen · Xueping Wang · Meikang Qiu · Hang Li
|
ExHall D Poster #354 | |
StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
Poster Session 5
Mingkun Lei · Xue Song · Beier Zhu · Hao Wang · Chi Zhang
|
ExHall D Poster #228 | |
GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction
Poster Session 5
Jinguang Tong · Xuesong li · Fahira Afzal Maken · Sundaram Muthu · Lars Petersson · Chuong Nguyen · Hongdong Li
|
ExHall D Poster #47 | |
Coherent 3D Portrait Video Reconstruction via Triplane Fusion
Poster Session 3
Shengze Wang · Xueting Li · Chao Liu · Matthew Chan · Michael Stengel · Henry Fuchs · Shalini De Mello · Koki Nagano
|
ExHall D Poster #6 | |
CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation
Poster Session 4
Jiahao Li · Weijian Ma · Xueyang Li · Yunzhong Lou · Guichun Zhou · Xiangdong Zhou
|
ExHall D Poster #266 | |
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Poster Session 5
Tianwei Yin · Qiang Zhang · Richard Zhang · William Freeman · Fredo Durand · Eli Shechtman · Xun Huang
|
ExHall D Poster #181 | |
Logits DeConfusion with CLIP for Few-Shot Learning
Poster Session 5
Shuo Li · Fang Liu · Zehua Hao · Xinyi Wang · Lingling Li · Xu Liu · Puhua Chen · Wenping Ma
|
ExHall D Poster #417 | |
Bridging Gait Recognition and Large Language Models Sequence Modeling
Poster Session 1
Shaopeng Yang · Jilong Wang · Saihui Hou · Xu Liu · Chunshui Cao · Liang Wang · Yongzhen Huang
|
ExHall D Poster #314 | |
Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing
Poster Session 4
Yanjun Li · Zhaoyang Li · Honghui Chen · li'Zhi Xu
|
ExHall D Poster #310 | |
DiffCAM: Data-Driven Saliency Maps by Capturing Feature Differences
Xingjian Li · Qiming Zhao · Neelesh Bisht · Mostofa Uddin Uddin · Jin Yu Kim · Bryan Zhang · Min Xu
|
ExHall D Poster #472 | |
G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
Poster Session 1
Tianxing Chen · Yao Mu · Zhixuan Liang · Zanxin Chen · ShijiaPeng · Qiangyu Chen · Mingkun Xu · Ruizhen Hu · Hongyuan Zhang · Xuelong Li · Ping Luo
|
ExHall D Poster #146 | |
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
Poster Session 1
Zedong Chu · Feng Xiong · Meiduo Liu · Jinzhi Zhang · Mingqi Shao · Zhaoxu Sun · Di Wang · Mu Xu
|
ExHall D Poster #13 | |
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching
Poster Session 5
Hualie Jiang · Zhiqiang Lou · Laiyan Ding · Rui Xu · Minglang Tan · jerett · Rui Huang
|
ExHall D Poster #77 | |
Language-Guided Audio-Visual Learning for Long-Term Sports Assessment
Poster Session 5
Huangbiao Xu · Xiao Ke · Huanqi Wu · Rui Xu · Yuezhou Li · Wenzhong Guo
|
ExHall D Poster #282 | |
VidTwin: Video VAE with Decoupled Structure and Dynamics
Poster Session 5
Yuchi Wang · Junliang Guo · Xinyi Xie · Tianyu He · Xu Sun · Jiang Bian
|
ExHall D Poster #177 | |
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Poster Session 6
Jianing Yang · Xuweiyi Chen · Nikhil Madaan · Madhavan Iyengar · Shengyi Qian · David Fouhey · Joyce Chai
|
ExHall D Poster #320 | |
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning
Poster Session 2
Jihyun Lee · Weipeng Xu · Alexander Richard · Shih-En Wei · Shunsuke Saito · Shaojie Bai · Te-Li Wang · Minhyuk Sung · Tae-Kyun Kim · Jason Saragih
|
ExHall D Poster #166 | |
EventFly: Event Camera Perception from Ground to the Sky
Poster Session 1
Lingdong Kong · Dongyue Lu · Xiang Xu · Lai Xing Ng · Wei Tsang Ooi · Benoit Cottereau
|
ExHall D Poster #122 | |
Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model
Poster Session 4
Jian Zhu · He Wang · Yang Xu · Zebin Wu · Zhihui Wei
|
ExHall D Poster #196 | |
Semantic and Sequential Alignment for Referring Video Object Segmentation
Poster Session 4
Feiyu Pan · Hao Fang · Fangkai Li · Yanyu Xu · Yawei Li · Luca Benini · Xiankai Lu
|
ExHall D Poster #312 | |
Minding Fuzzy Regions: A Data-driven Alternating Learning Paradigm for Stable Lesion Segmentation
Poster Session 2
Lexin Fang · Yunyang Xu · Xiang Ma · Xuemei Li · Caiming Zhang
|
ExHall D Poster #481 | |
DropoutGS: Dropping Out Gaussians for Better Sparse-view Rendering
Poster Session 1
Yexing Xu · Longguang Wang · Minglin Chen · Sheng Ao · Li Li · Yulan Guo
|
ExHall D Poster #50 | |
Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization
Poster Session 4
Zhipeng Xu · De Cheng · XINYANG JIANG · Nannan Wang · Dongsheng Li · Xinbo Gao
|
ExHall D Poster #268 | |
AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
Poster Session 1
Li Lin · Santosh Santosh · Mingyang Wu · Xin Wang · Shu Hu
|
ExHall D Poster #318 | |
3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations
Poster Session 5
yating wang · Xuan Wang · Ran Yi · Yanbo Fan · Jichen Hu · Jingcheng Zhu · Lizhuang Ma
|
ExHall D Poster #7 | |
Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction
Poster Session 2
Xia Wenke · Ruoxuan Feng · Dong Wang · Di Hu
|
ExHall D Poster #154 | |
Reconstructing Humans with a Biomechanically Accurate Skeleton
Poster Session 2
Yan Xia · Xiaowei Zhou · Etienne Vouga · Qixing Huang · Georgios Pavlakos
|
ExHall D Poster #91 | |
Glossy Object Reconstruction with Cost-effective Polarized Acquisition
Bojian Wu · YIFAN PENG · Ruizhen Hu · Xiaowei Zhou
|
ExHall D Poster #24 | |
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
Poster Session 2
Weiyu Li · Jiarui Liu · Hongyu Yan · Rui Chen · Yixun Liang · Xuelin Chen · Ping Tan · Xiaoxiao Long
|
ExHall D Poster #40 | |
Generative Sparse-View Gaussian Splatting
Poster Session 6
Hanyang Kong · Xingyi Yang · Xinchao Wang
|
ExHall D Poster #58 | |
ProjAttacker: A Configurable Physical Adversarial Attack for Face Recognition via Projector
Poster Session 5
Yuanwei Liu · Hui Wei · Chengyu Jia · Ruqi Xiao · Weijian Ruan · Xingxing Wei · Joey Tianyi Zhou · Zheng Wang
|
ExHall D Poster #19 | |
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
Poster Session 5
Xiangyuan Xue · Zeyu Lu · Di Huang · ZiDong Wang · Wanli Ouyang · Lei Bai
|
ExHall D Poster #343 | |
Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features
Yuanbo Xiangli · Ruojin Cai · Hanyu Chen · Jeffrey Byrne · Noah Snavely
|
ExHall D Poster #97 | |
ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting
Poster Session 6
Guo Junfu · Yu Xin · Gaoyi Liu · Kai Xu · Ligang Liu · Ruizhen Hu
|
ExHall D Poster #95 | |
Hash3D: Training-free Acceleration for 3D Generation
Poster Session 5
Xingyi Yang · Songhua Liu · Xinchao Wang
|
ExHall D Poster #41 | |
Progressive Focused Transformer for Single Image Super-Resolution
Poster Session 1
Wei Long · Xingyu Zhou · Leheng Zhang · Shuhang Gu
|
ExHall D Poster #199 | |
Learned Image Compression with Dictionary-based Entropy Model
Poster Session 3
Jingbo Lu · Leheng Zhang · Xingyu Zhou · Mu Li · Wen Li · Shuhang Gu
|
ExHall D Poster #210 | |
Towards Understanding How Knowledge Evolves in Large Vision-Language Models
Poster Session 6
Sudong Wang · Yunjian Zhang · Yao Zhu · Jianing Li · Zizhe Wang · Yanwei Liu · Xiangyang Ji
|
ExHall D Poster #355 | |
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
Poster Session 4
Shian Du · Menghan Xia · Chang Liu · Xintao Wang · Jing Wang · Pengfei Wan · Di ZHANG · Xiangyang Ji
|
ExHall D Poster #190 | |
Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics
Poster Session 6
Yair Smadar · Assaf Hoogi
|
ExHall D Poster #385 | |
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
Poster Session 4
Peishan Cong · Ziyi Wang · Yuexin Ma · Xiangyu Yue
|
ExHall D Poster #168 | |
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models
Poster Session 3
Haoran Hao · Jiaming Han · Changsheng Li · Yu-Feng Li · Xiangyu Yue
|
ExHall D Poster #371 | |
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Poster Session 6
Chen Tang · Xinzhu Ma · Encheng Su · Xiufeng Song · Xiaohong Liu · Wei-Hong Li · Lei Bai · Wanli Ouyang · Xiangyu Yue
|
ExHall D Poster #293 | |
AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation
Poster Session 2
Zeyi Xu · Jinfan Liu · Kuangxu Chen · Ye Chen · Zhangli Hu · Bingbing Ni
|
ExHall D Poster #34 | |
AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward
Poster Session 5
Haonan Han · Xiangzuo Wu · Huan Liao · Zunnan Xu · Zhongyuan Hu · Ronghui Li · Yachao Zhang · Xiu Li
|
ExHall D Poster #160 | |
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
Poster Session 1
David Junhao Zhang · Roni Paiss · Shiran Zada · Nikhil Karnad · David E. Jacobs · Yael Pritch · Inbar Mosseri · Mike Zheng Shou · Neal Wadhwa · Nataniel Ruiz
|
ExHall D Poster #177 | |
SketchAgent: Language-Driven Sequential Sketch Generation
Poster Session 5
Yael Vinker · Tamar Rott Shaham · Kristine Zheng · Alex Zhao · Judith Fan · Antonio Torralba
|
ExHall D Poster #220 | |
HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Poster Session 6
Xinpeng Liu · Zeyi Huang · Fumio Okura · Yasuyuki Matsushita
|
ExHall D Poster #54 | |
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization
Poster Session 2
Junhao Xu · Yanan Zhang · Zhi Cai · Di Huang
|
ExHall D Poster #140 | |
MNE-SLAM: Multi-Agent Neural SLAM for Mobile Robots
Poster Session 1
Tianchen Deng · Guole Shen · Chen Xun · Shenghai Yuan · Tongxing Jin · Hongming Shen · Yanbo Wang · Jingchuan Wang · Hesheng Wang · Danwei Wang · Weidong Chen
|
ExHall D Poster #123 | |
HuMoCon: Concept Discovery for Human Motion Understanding
Poster Session 2
Qihang Fang · Chengcheng Tang · Bugra Tekin · Shugao Ma · Yanchao Yang
|
ExHall D Poster #174 | |
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
Poster Session 1
Jianwei Tang · Hong Yang · Tengyue Chen · Jian-Fang Hu
|
ExHall D Poster #159 | |
Hierarchical Flow Diffusion for Efficient Frame Interpolation
Poster Session 5
Yang Hai · Guo Wang · Tan Su · jerett · Yinlin Hu
|
ExHall D Poster #179 | |
Multi-party Collaborative Attention Control for Image Customization
Poster Session 2
Han Yang · Chuanguang Yang · Qiuli Wang · Zhulin An · Weilun Feng · Libo Huang · Yongjun Xu
|
ExHall D Poster #246 | |
Seek Common Ground While Reserving Differences: Semi-Supervised Image-Text Sentiment Recognition
Poster Session 6
Wuyou Xia · Guoli Jia · Sicheng Zhao · Jufeng Yang
|
ExHall D Poster #330 | |
ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning
Haoyuan Yang · Xiaoou Li · Jiaming Lv · Xianjun Cheng · Qilong Wang · Peihua Li
|
ExHall D Poster #370 | |
TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception
Poster Session 3
Zhiying Song · Lei Yang · Fuxi Wen · Jun Li
|
ExHall D Poster #135 | |
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object
Poster Session 1
Zhe Shan · Yang Liu · Lei Zhou · Cheng Yan · Heng Wang · Xia Xie
|
ExHall D Poster #329 | |
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model
Poster Session 2
Longrong Yang · Dong Shen · Chaoxiang Cai · Kaibing Chen · Fan Yang · Tingting Gao · Di ZHANG · Xi Li
|
ExHall D Poster #384 | |
SEC-Prompt:SEmantic Complementary Prompting for Few-Shot Class-Incremental Learning
Poster Session 5
Ye Liu · Meng Yang
|
ExHall D Poster #440 | |
GaussianSpa: An “Optimizing-Sparsifying” Simplification Framework for Compact and High-Quality 3D Gaussian Splatting
Poster Session 6
Yangming Zhang · Wenqi Jia · Wei Niu · Miao Yin
|
ExHall D Poster #49 | |
Person De-reidentification: A Variation-guided Identity Shift Modeling
Poster Session 6
Yi-Xing Peng · Yu-Ming Tang · Kun-Yu Lin · Qize Yang · Jingke Meng · Xihan Wei · Wei-Shi Zheng
|
ExHall D Poster #304 | |
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
Poster Session 5
Lei Fan · Dongdong Fan · Zhiguang Hu · Yiwen Ding · Donglin Di · Kai Yi · Maurice Pagnucco · Yang Song
|
ExHall D Poster #428 | |
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
Poster Session 4
Keda Tao · Can Qin · Haoxuan You · Yang Sui · Huan Wang
|
ExHall D Poster #305 | |
RORem: Training a Robust Object Remover with Human-in-the-Loop
Poster Session 3
Ruibin Li · Tao Yang · Song Guo · Lei Zhang
|
ExHall D Poster #323 | |
Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection
Poster Session 1
Wenxi Chen · Raymond A. Yeh · Shaoshuai Mou · Yan Gu
|
ExHall D Poster #436 | |
MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning
Poster Session 5
Wenhao Gu · Li Gu · Ching Suen · Yang Wang
|
ExHall D Poster #233 | |
Progress-Aware Video Frame Captioning
Poster Session 3
Zihui Xue · Joungbin An · Xitong Yang · Kristen Grauman
|
ExHall D Poster #285 | |
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
Poster Session 5
Luo · Xue Yang · Wenhan Dou · Zhaokai Wang · Jiawen Liu · Jifeng Dai · Yu Qiao · Xizhou Zhu
|
ExHall D Poster #375 | |
Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification
Poster Session 5
Rui Gong · Kim-Hui Yap · Weide Liu · Xulei Yang · Jun Cheng
|
ExHall D Poster #124 | |
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
Poster Session 1
Rong Li · Shijie Li · Lingdong Kong · Xulei Yang · Junwei Liang
|
ExHall D Poster #337 | |
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Poster Session 1
You Li · Fan Ma · Yi Yang
|
ExHall D Poster #363 | |
DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation
Poster Session 6
Mu Chen · Liulei Li · Wenguan Wang · Yi Yang
|
ExHall D Poster #288 | |
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
Poster Session 5
Chao Wang · Hehe Fan · Huichen Yang · Sarvnaz Karimi · Lina Yao · Yi Yang
|
ExHall D Poster #237 | |
Multimodal Autoregressive Pre-training of Large Vision Encoders
Enrico Fini · Mustafa Shukor · Xiujun Li · Philipp Dufter · Michal Klein · David Haldimann · Sai Aitharaju · Victor Guilherme Turrisi da Costa · Louis Béthune · Zhe Gan · Alexander Toshev · Marcin Eichner · Moin Nabi · Yinfei Yang · Joshua Susskind · Alaaeldin El-Nouby
|
ExHall D Poster #407 | |
Efficient Diffusion as Low Light Enhancer
Poster Session 5
Guanzhou Lan · Qianli Ma · YUQI YANG · Zhigang Wang · Dong Wang · Xuelong Li · Bin Zhao
|
ExHall D Poster #22 | |
ADD: Attribution-Driven Data Augmentation Framework for Boosting Image Super-Resolution
Poster Session 5
Zeyu Mi · Yu-Bin Yang
|
ExHall D Poster #194 | |
MaRI: Material Retrieval Integration across Domains
Poster Session 2
Jianhui Wang · Zhifei Yang · Yangfan He · Huixiong Zhang · Yuxuan Chen · Jingwei Huang
|
ExHall D Poster #35 | |
Gain from Neighbors: Boosting Model Robustness in the Wild via Adversarial Perturbations Toward Neighboring Classes
Poster Session 5
Zhou Yang · Mingtao Feng · Tao Huang · Fangfang Wu · Weisheng Dong · Xin Li · Guangming Shi
|
ExHall D Poster #426 | |
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Poster Session 2
Wenyi Hong · Yean Cheng · Zhuoyi Yang · Weihan Wang · Lefan Wang · Xiaotao Gu · Shiyu Huang · Yuxiao Dong · Jie Tang
|
ExHall D Poster #294 | |
Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs
Youyi Zhan · Tianjia Shao · Yin Yang · Kun Zhou
|
ExHall D Poster #9 | |
High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model
Yiyang Shen · Kun Zhou · He Wang · Yin Yang · Tianjia Shao
|
ExHall D Poster #48 | |
EnliveningGS: Active Locomotion of 3DGS
Poster Session 1
Siyuan Shen · Tianjia Shao · Kun Zhou · Chenfanfu Jiang · Yin Yang
|
ExHall D Poster #68 | |
ARM: Appearance Reconstruction Model for Relightable 3D Generation
Xiang Feng · Chang Yu · Zoubin Bi · Yintong Shang · Feng Gao · Hongzhi Wu · Kun Zhou · Chenfanfu Jiang · Yin Yang
|
ExHall D Poster #36 | |
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments
Poster Session 6
Haisheng Su · Feixiang Song · CONG MA · Wei Wu · Junchi Yan
|
ExHall D Poster #123 | |
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection
Poster Session 1
Xin Jin · Haisheng Su · Kai Liu · CONG MA · Wei Wu · Fei HUI · Junchi Yan
|
ExHall D Poster #115 | |
High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model
Poster Session 1
Mingtao Guo · Guanyu Xing · Yanli Liu
|
ExHall D Poster #6 | |
RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training
Raktim Gautam Goswami · Prashanth Krishnamurthy · Yann LeCun · Farshad Khorrami
|
ExHall D Poster #149 | |
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Poster Session 4
Tal Zeevi · Ravid Shwartz-Ziv · Yann LeCun · Lawrence Staib · John A Onofrey
|
ExHall D Poster #471 | |
AA-CLIP: Enhancing Zero-Shot Anomaly Detection via Anomaly-Aware CLIP
Poster Session 1
wenxin ma · Xu Zhang · Qingsong Yao · Fenghe Tang · Chenxu Wu · Yingtai Li · Rui Yan · Zihang Jiang · S Kevin Zhou
|
ExHall D Poster #438 | |
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
Poster Session 1
Kun Yang · Yuxiang Liu · Zeyu Cui · Yu Liu · Maojun Zhang · Shen Yan · Qing Wang
|
ExHall D Poster #49 | |
Deep Change Monitoring: A Hyperbolic Representative Learning Framework and a Dataset for Long-term Fine-grained Tree Change Detection
Yante Li · Hanwen Qi · Haoyu Chen · Liang Xinlian · Guoying Zhao
|
ExHall D Poster #114 | |
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
Poster Session 2
Liang Chen · Zhe Xue · Yawen Li · Meiyu Liang · Yan Wang · Anton van den Hengel · Yuankai Qi
|
ExHall D Poster #469 | |
Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization
Poster Session 2
lingyun zhang · Yu Xie · Yanwei Fu · Ping Chen
|
ExHall D Poster #267 | |
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training
Jierun Chen · Dongting Hu · Xijie Huang · Huseyin Coskun · Arpit Sahni · Aarush Gupta · Anujraaj Goyal · Dishani Lahiri · Rajesh Singh · Yerlan Idelbayev · Junli Cao · Yanyu Li · Kwang-Ting Cheng · Mingming Gong · S.-H. Gary Chan · Sergey Tulyakov · Anil Kag · Yanwu Xu · Jian Ren
|
ExHall D Poster #251 | |
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images
Poster Session 1
Jiuchen Chen · Xinyu Yan · Qizhi Xu · Kaiqi Li
|
ExHall D Poster #197 | |
Incomplete Multi-View Multi-label Learning via Disentangled Representation and Label Semantic Embedding
Poster Session 6
Xu Yan · Jun Yin · Jie Wen
|
ExHall D Poster #438 | |
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Poster Session 2
Haoyu Guo · He Zhu · Sida Peng · Haotong Lin · Yunzhi Yan · Tao Xie · Wenguan Wang · Xiaowei Zhou · Hujun Bao
|
ExHall D Poster #80 | |
Explicit Depth-Aware Blurry Video Frame Interpolation Guided by Differential Curves
Poster Session 1
yan zaoming · pengcheng lei · Tingting Wang · Faming Fang · Junkang Zhang · Yaomin Huang · Haichuan Song
|
ExHall D Poster #170 | |
Learnable Infinite Taylor Gaussian for Dynamic View Rendering
Poster Session 6
Bingbing Hu · Yanyan Li · rui xie · Bo Xu · Haoye Dong · Junfeng Yao · Gim Hee Lee
|
ExHall D Poster #67 | |
Consistency Posterior Sampling for Diverse Image Synthesis
Poster Session 6
Vishal Purohit · Matthew Repasky · Jianfeng Lu · Qiang Qiu · Yao Xie · Xiuyuan Cheng
|
ExHall D Poster #206 | |
ChatHuman: Chatting about 3D Humans with Tools
Poster Session 2
Jing Lin · Yao Feng · Weiyang Liu · Michael J. Black
|
ExHall D Poster #265 | |
Language Guided Concept Bottleneck Models for Interpretable Continual Learning
Poster Session 3
Lu Yu · HaoYu Han · Zhe Tao · Hantao Yao · Changsheng Xu
|
ExHall D Poster #414 | |
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Poster Session 5
Hao Wen · Zehuan Huang · Yaohui Wang · Xinyuan Chen · Lu Sheng
|
ExHall D Poster #55 | |
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
Poster Session 1
Jin Wang · Chenghui Lv · Xian Li · Shichao Dong · Huadong Li · kelu Yao · Chao Li · Wenqi Shao · Ping Luo
|
ExHall D Poster #388 | |
Hierarchical Gaussian Mixture Model Splatting for Efficient and Part Controllable 3D Generation
Poster Session 3
Qitong Yang · Mingtao Feng · Zijie Wu · Weisheng Dong · Fangfang Wu · Yaonan Wang · Ajmal Mian
|
ExHall D Poster #43 | |
Feature Information Driven Position Gaussian Distribution Estimation for Tiny Object Detection
Poster Session 6
Jinghao Bian · Mingtao Feng · Weisheng Dong · Fangfang Wu · Jianqiao Luo · Yaonan Wang · Guangming Shi
|
ExHall D Poster #405 | |
Open-World Objectness Modeling Unifies Novel Object Detection
Poster Session 6
Shan Zhang · Yao Ni · Jinhao Du · Yuan Xue · Philip H.S. Torr · Piotr Koniusz · Anton van den Hengel
|
ExHall D Poster #401 | |
NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Poster Session 2
Ruihan Xu · Haokui Zhang · Yaowei Wang · Wei Zeng · Shiliang Zhang
|
ExHall D Poster #441 | |
Video Language Model Pretraining with Spatio-temporal Masking
Poster Session 2
Yue Wu · Zhaobo Qi · Junshu Sun · Yaowei Wang · Qingming Huang · Shuhui Wang
|
ExHall D Poster #304 | |
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
Poster Session 6
Xiaoqi Li · Lingyun Xu · Mingxu Zhang · Jiaming Liu · Yan Shen · Iaroslav Ponomarenko · Jiahui Xu · Liang Heng · Siyuan Huang · Shanghang Zhang · Hao Dong
|
ExHall D Poster #141 | |
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images
Poster Session 1
Rong Wang · Fabian Prada · Ziyan Wang · Zhongshi Jiang · Chengxiang Yin · Junxuan Li · Shunsuke Saito · Igor Santesteban · Javier Romero · Rohan Joshi · Hongdong Li · Jason Saragih · Yaser Sheikh
|
ExHall D Poster #11 | |
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Poster Session 5
Mariam Hassan · Sebastian Stapf · Ahmad Rahimi · Pedro M B Rezende · Yasaman Haghighi · David Brüggemann · Isinsu Katircioglu · Lin Zhang · Xiaoran Chen · Suman Saha · Marco Cannici · Elie Aljalbout · Botao Ye · Xi Wang · Aram Davtyan · Mathieu Salzmann · Davide Scaramuzza · Marc Pollefeys · Paolo Favaro · Alex Alahi
|
ExHall D Poster #129 | |
Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attack on Breast Ultrasound Images
Poster Session 6
Yasamin Medghalchi · Moein Heidari · Clayton Allard · Leonid Sigal · Ilker Hacihaliloglu
|
ExHall D Poster #229 | |
Harnessing Frozen Unimodal Encoders for Flexible Multimodal Alignment
Poster Session 6
Mayug Maniparambil · Raiymbek Akshulakov · YASSER ABDELAZIZ DAHOU DJILALI · Sanath Narayan · Ankit Singh · Noel O'Connor
|
ExHall D Poster #354 | |
NoT: Federated Unlearning via Weight Negation
Poster Session 5
Yasser Khalil · Leo Maxime Brunswic · Soufiane Lamghari · Xu Li · Mahdi Beitollahi · Xi Chen
|
ExHall D Poster #452 | |
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models
Poster Session 5
Yassir Bendou · Amine Ouasfi · Vincent Gripon · Adnane Boukhayma
|
ExHall D Poster #387 | |
Gromov–Wasserstein Problem with Cyclic Symmetry
Poster Session 5
Shoichiro Takeda · Yasunori Akagi
|
ExHall D Poster #69 | |
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers
Poster Session 1
Daoyi Gao · Mohd Yawar Nihal Siddiqui · Lei Li · Angela Dai
|
ExHall D Poster #42 | |
A3: Few-shot Prompt Learning of Unlearnable Examples with Cross-Modal Adversarial Feature Alignment
Poster Session 2
Wang Xuan · Xitong Gao · Dongping Liao · Tianrui Qin · Yu-liang Lu · Cheng-Zhong Xu
|
ExHall D Poster #394 | |
Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment
Poster Session 1
Yang Bai · Yucheng Ji · Min Cao · Jinqiao Wang · Mang Ye
|
ExHall D Poster #360 | |
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
Poster Session 5
Kazi Sajeed Mehrab · M. Maruf · Arka Daw · Abhilash Neog · Harish Babu Manogaran · Mridul Khurana · Zhenyang Feng · Bahadir Altintas · Yasin Bakis · Elizabeth Campolongo · Matthew Thompson · Xiaojun Wang · Hilmar Lapp · Tanya Berger-Wolf · Paula Mabee · Henry Bart · Wei-Lun Chao · Wasla Dahdul · Anuj Karpatne
|
ExHall D Poster #311 | |
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation
Poster Session 6
Andrew Z Wang · Songwei Ge · Tero Karras · Ming-Yu Liu · Yogesh Balaji
|
ExHall D Poster #230 | |
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Poster Session 4
Huajie Jiang · Zhengxian Li · Xiaohan Yu · Yongli Hu · Baocai Yin · Jian Yang · Yuankai Qi
|
ExHall D Poster #426 | |
Exploring Historical Information for RGBE Visual Tracking with Mamba
Poster Session 2
Chuanyu Sun · Jiqing Zhang · Yang Wang · Huilin Ge · qianchen xia · Baocai Yin · Xin Yang
|
ExHall D Poster #107 | |
MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation
Shu Wang · Yanbo Gao · Shuai Li · Chong Lv · Xun Cai · chuankun Li · Hui Yuan · jinglin zhang
|
ExHall D Poster #32 | |
Active Hyperspectral Imaging Using an Event Camera
Bohan Yu · Jinxiu Liang · Zhuofeng Wang · Bin Fan · Art Subpaasa · Boxin Shi · Imari Sato
|
ExHall D Poster #71 | |
EventPSR: Surface Normal and Reflectance Estimation from Photometric Stereo Using an Event Camera
Bohan Yu · Jin Han · Boxin Shi · Imari Sato
|
ExHall D Poster #73 | |
SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model
Poster Session 4
Chongkai Yu · Ting Liu · Li Anqi · Xiaochao Qu · WU CHENGJING · Luoqi Liu · Xiaolin Hu
|
ExHall D Poster #339 | |
Generative Omnimatte: Learning to Decompose Video into Layers
Poster Session 3
Yao-Chih Lee · Erika Lu · Sarah Rumbley · Michal Geyer · Jia-Bin Huang · Tali Dekel · Forrester Cole
|
ExHall D Poster #178 | |
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Poster Session 1
Hongjie Wang · Chih-Yao Ma · Yen-Cheng Liu · Ji Hou · Tao Xu · Jialiang Wang · Felix Juefei-Xu · Yaqiao Luo · Peizhao Zhang · Tingbo Hou · Peter Vajda · Niraj Jha · Xiaoliang Dai
|
ExHall D Poster #231 | |
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Poster Session 4
Kuang Wu · Chuan Yang · Zhanbin Li
|
ExHall D Poster #130 | |
Can't Slow Me Down: Learning Robust and Hardware-Adaptive Object Detectors against Latency Attacks for Edge Devices
Poster Session 4
Tianyi Wang · Zichen Wang · Cong Wang · Yuanchao Shu · Ruilong Deng · Peng Cheng · Jiming Chen
|
ExHall D Poster #327 | |
Scaling Inference Time Compute for Diffusion Models
Nanye Ma · Shangyuan Tong · Haolin Jia · Hexiang Hu · Yu-Chuan Su · Mingda Zhang · Xuan Yang · Yandong Li · Tommi Jaakkola · Xuhui Jia · Saining Xie
|
ExHall D Poster #226 | |
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
Poster Session 1
Hongyu Sun · Qiuhong Ke · Ming Cheng · Yongcai Wang · Deying Li · Chenhui Gou · Jianfei Cai
|
ExHall D Poster #102 | |
FluxSpace: Disentangled Semantic Editing in Rectified Flow Models
Poster Session 3
Yusuf Dalva · Kavana Venkatesh · Pinar Yanardag
|
ExHall D Poster #232 | |
SKE-Layout: Spatial Knowledge Enhanced Layout Generation with LLMs
Poster Session 4
Junsheng Wang · Nieqing Cao · Yan Ding · Mengying Xie · Fuqiang Gu · Chao Chen
|
ExHall D Poster #344 | |
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation
Poster Session 5
Yuanqi Yao · Siao Liu · Haoming Song · Delin Qu · Qizhi Chen · Yan Ding · Bin Zhao · Zhigang Wang · Dong Wang · Xuelong Li
|
ExHall D Poster #144 | |
A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering
Poster Session 3
Zheming Xu · He Liu · Congyan Lang · Tao Wang · Yidong Li · Michael C. Kampffmeyer
|
ExHall D Poster #467 | |
Anatomical Consistency and Adaptive Prior-informed Transformation for Multi-contrast MR Image Synthesis via Diffusion Model
Poster Session 6
Yejee Shin · Yeeun Lee · Hanbyol Jang · Geonhui Son · Hyeongyu Kim · Dosik Hwang
|
ExHall D Poster #456 | |
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning
Poster Session 1
Jeongryong Lee · Yejee Shin · Geonhui Son · Dosik Hwang
|
ExHall D Poster #369 | |
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
Poster Session 1
Ying Chen · Guoan Wang · Yuanfeng Ji · Yanjun Li · Jin Ye · Tianbin Li · Ming Hu · Rongshan Yu · Yu Qiao · Junjun He
|
ExHall D Poster #475 | |
Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
Poster Session 6
Zilong Huang · Jun He · Junyan Ye · Lihan Jiang · Weijia Li · Yiping Chen · Ting Han
|
ExHall D Poster #55 | |
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
Poster Session 1
Chen Liu · Liying Yang · Peike Li · Dadong Wang · Lincheng Li · Xin Yu
|
ExHall D Poster #284 | |
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
Poster Session 6
Chen Liu · Peike Li · Liying Yang · Dadong Wang · Lincheng Li · Xin Yu
|
ExHall D Poster #262 | |
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior
Poster Session 3
Chanhui Lee · Yeonghwan Song · Jeany Son
|
ExHall D Poster #311 | |
SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing
Poster Session 6
Xueting Li · Ye Yuan · Shalini De Mello · Miles Macklin · Jonathan Leaf · Gilles Daviet · Jan Kautz · Umar Iqbal
|
ExHall D Poster #13 | |
BLADE: Single-view Body Mesh Estimation through Accurate Depth Estimation
Poster Session 5
Shengze Wang · Jiefeng Li · Tianye Li · Ye Yuan · Henry Fuchs · Koki Nagano · Shalini De Mello · Michael Stengel
|
ExHall D Poster #90 | |
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao liang · Baoquan Zhang · Zhiyuan Wen · Junteng Zhao · Yunming Ye · Guangming Ye · Yao He
|
ExHall D Poster #371 | |
AlphaPre: Amplitude-Phase Disentanglement Model for Precipitation Nowcasting
Poster Session 4
Kenghong Lin · Baoquan Zhang · Demin Yu · Wenzhi Feng · Shidong Chen · Feifan Gao · Xutao Li · Yunming Ye
|
ExHall D Poster #194 | |
Sensitivity-Aware Efficient Fine-Tuning via Compact Dynamic-Rank Adaptation
Poster Session 2
Tianran Chen · Jiarui Chen · Baoquan Zhang · Zhehao Yu · Shidong Chen · Rui Ye · Xutao Li · Yunming Ye
|
ExHall D Poster #408 | |
Overcoming Shortcut Problem in VLM for Robust Out-of-Distribution Detection
Zhuo Xu · Xiang Xiang · Yifan Liang
|
ExHall D Poster #455 | |
Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification
Poster Session 6
Gaozheng Pei · Shaojie Lyu · Gong Chen · Ke Ma · Qianqian Xu · Yingfei Sun · Qingming Huang
|
ExHall D Poster #298 | |
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Poster Session 2
Fanqi Pu · Yifan Wang · Jiru Deng · Wenming Yang
|
ExHall D Poster #109 | |
PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors
Poster Session 6
Guangshun Wei · Yuan Feng · Long Ma · Chen Wang · Yuanfeng Zhou · Changjian Li
|
ExHall D Poster #104 | |
Autoregressive Sequential Pretraining for Visual Tracking
Poster Session 2
Shiyi Liang · Yifan Bai · Yihong Gong · Xing Wei
|
ExHall D Poster #181 | |
Asynchronous Collaborative Graph Representation for Frames and Events
Poster Session 1
Dianze Li · Jianing Li · Xu Liu · Xiaopeng Fan · Yonghong Tian
|
ExHall D Poster #139 | |
VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning
Poster Session 6
Haoran Xu · Peixi Peng · Guang Tan · Yiqian Chang · Luntong Li · Yonghong Tian
|
ExHall D Poster #323 | |
Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Zeqing Wang · Qingyang Ma · Wentao Wan · Haojie Li · Keze Wang · Yonghong Tian
|
ExHall D Poster #17 | |
DreamRelation: Bridging Customization and Relation Generation
Poster Session 4
Qingyu Shi · Lu Qi · Jianzong Wu · Jinbin Bai · Jingbo Wang · Yunhai Tong · Xiangtai Li
|
ExHall D Poster #251 | |
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
Poster Session 4
Wenrui Cai · Qingjie Liu · Yunhong Wang
|
ExHall D Poster #100 | |
Perceptual Video Compression with Neural Wrapping
Poster Session 4
Muhammad Umar Karim Khan · Aaron Chadha · Mohammad Ashraful Anam · Yiannis Andreopoulos
|
ExHall D Poster #185 | |
Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation
Poster Session 3
Jingxi Chen · Brandon Y. Feng · Haoming Cai · Tianfu Wang · Levi Burner · Dehao Yuan · Cornelia Fermuller · Christopher Metzler · Yiannis Aloimonos
|
ExHall D Poster #172 | |
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows
Poster Session 6
Shentong Mo · Yibing Song
|
ExHall D Poster #261 | |
Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
Poster Session 4
Sihao Liu · Yibo Yang · Xiaojie Li · David A. Clifton · Bernard Ghanem
|
ExHall D Poster #447 | |
Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses
Poster Session 2
Yongfan Liu · Hyoukjun Kwon
|
ExHall D Poster #78 | |
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
Poster Session 4
Yifan Yu · Shaohui Liu · Rémi Pautrat · Marc Pollefeys · Viktor Larsson
|
ExHall D Poster #84 | |
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Poster Session 6
Hongbin Lin · Zilu Guo · Yifan Zhang · Shuaicheng Niu · Yafeng Li · Ruimao Zhang · Shuguang Cui · Zhen Li
|
ExHall D Poster #128 | |
pFedMxF: Personalized Federated Class-Incremental Learning with Mixture of Frequency Aggregation
Poster Session 6
Yifei Zhang · Hao Zhu · Alysa Ziying Tan · Dianzhi Yu · Longtao Huang · Han Yu
|
ExHall D Poster #430 | |
BiLoRA: Almost-Orthogonal Parameter Spaces for Continual Learning
Poster Session 5
Hao Zhu · Yifei Zhang · Junhao Dong · Piotr Koniusz
|
ExHall D Poster #437 | |
BrepGiff: Lightweight Generation of Complex B-rep with 3D GAT Diffusion
Poster Session 6
Hao Guo · Xiaoshui Huang · Hao jiacheng · Yunpeng Bai · Hongping Gan · Yilei Shi
|
ExHall D Poster #41 | |
Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression
Poster Session 3
Jie Liu · Tiexin Qin · Hui Liu · Yilei Shi · Lichao Mou · Xiao Xiang Zhu · Shiqi Wang · Haoliang Li
|
ExHall D Poster #470 | |
Split Adaptation for Pre-trained Vision Transformers
Poster Session 4
Lixu Wang · Bingqi Shang · Yi Li · Payal Mohapatra · Wei Dong · Xiao Wang · Qi Zhu
|
ExHall D Poster #409 | |
TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition
Poster Session 1
yilong wang · Zilin Gao · Qilong Wang · Zhaofeng Chen · Peihua Li · Qinghua Hu
|
ExHall D Poster #313 | |
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
Poster Session 4
Qi Lv · Hao Li · Xiang Deng · Rui Shao · Yinchuan Li · Jianye Hao · Longxiang Gao · MICHAEL YU WANG · Liqiang Nie
|
ExHall D Poster #153 | |
De^2Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation
Poster Session 1
Yunfeng Xiao · Xiaowei Bai · Baojun Chen · Hao Su · Hao He · Liang Xie · Erwei Yin
|
ExHall D Poster #280 | |
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Bencheng Liao · Shaoyu Chen · haoran yin · Bo Jiang · Cheng Wang · Sixu Yan · xinbang zhang · Xiangyu Li · ying zhang · Qian Zhang · Xinggang Wang
|
ExHall D Poster #134 | |
Hyperbolic Category Discovery
Poster Session 2
Yuanpei Liu · Zhenqi He · Kai Han
|
ExHall D Poster #430 | |
Exploring Simple Open-Vocabulary Semantic Segmentation
Poster Session 6
Zihang Lai
|
ExHall D Poster #390 | |
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Poster Session 6
Alice Heiman · Xiaoman Zhang · Emma Chen · Sung Eun Kim · Pranav Rajpurkar
|
ExHall D Poster #444 | |
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Poster Session 4
Bangyan Liao · Zhenjun Zhao · Haoang Li · Yi Zhou · Yingping Zeng · Hao Li · Peidong Liu
|
ExHall D Poster #102 | |
DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation
Poster Session 3
Wang Zhao · Yan-Pei Cao · Jiale Xu · Yue-Jiang Dong · Ying Shan
|
ExHall D Poster #39 | |
UniVAD: A Training-free Unified Model for Few-shot Visual Anomaly Detection
Poster Session 3
Zhaopeng Gu · Bingke Zhu · Guibo Zhu · Yingying Chen · Ming Tang · Jinqiao Wang
|
ExHall D Poster #435 | |
EchoONE: Segmenting Multiple Echocardiography Planes in One Model
Poster Session 1
Jiongtong Hu · Wei Zhuo · Jun Cheng · YINGYING LIU · Wufeng Xue · Dong Ni
|
ExHall D Poster #482 | |
Mimic In-Context Learning for Multimodal Tasks
Poster Session 6
Yuchu Jiang · Jiale Fu · chenduo hao · Xinting Hu · Yingzhe Peng · Xin Geng · Xu Yang
|
ExHall D Poster #352 | |
ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models
Poster Session 3
Heng Yin · Yuqiang Ren · Ke Yan · Shouhong Ding · Yongtao Hao
|
ExHall D Poster #354 | |
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Poster Session 1
Hongyan Zhi · Peihao Chen · Junyan Li · Shuailei Ma · Xinyu Sun · Tianhang Xiang · Yinjie Lei · Mingkui Tan · Chuang Gan
|
ExHall D Poster #342 | |
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
Poster Session 3
Lijun Li · Zhelun Shi · Xuhao Hu · Bowen Dong · Yiran Qin · Xihui Liu · Lu Sheng · Jing Shao
|
ExHall D Poster #260 | |
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Poster Session 1
Yiran Xu · Taesung Park · Richard Zhang · Yang Zhou · Eli Shechtman · Feng Liu · Jia-Bin Huang · Difan Liu
|
ExHall D Poster #185 | |
STINR: Deciphering Spatial Transcriptomics via Implicit Neural Representation
Poster Session 5
Yisi Luo · Xile Zhao · Kai Ye · Deyu Meng
|
ExHall D Poster #470 | |
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise
Poster Session 1
Ryan Burgert · Yuancheng Xu · Wenqi Xian · Oliver Pilarski · Pascal Clausen · Mingming He · Li Ma · Yitong Deng · Lingxiao Li · Mohsen Mousavi · Michael Ryoo · Paul Debevec · Ning Yu
|
ExHall D Poster #174 | |
MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
Poster Session 2
Zilong Chen · Yikai Wang · Wenqiang Sun · Feng Wang · Yiwen Chen · Huaping Liu
|
ExHall D Poster #37 | |
EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis
Poster Session 3
Sheng Miao · Jiaxin Huang · Dongfeng Bai · Xu Yan · Hongyu Zhou · Yue Wang · Bingbing Liu · Andreas Geiger · Yiyi Liao
|
ExHall D Poster #60 | |
High Dynamic Range Video Compression: A Large-Scale Benchmark Dataset and A Learned Bit-depth Scalable Compression Algorithm
Poster Session 2
Zhaoyi Tian · Feifeng Wang · Shiwei Wang · Zihao Zhou · Yao Zhu · Liquan Shen
|
ExHall D Poster #187 | |
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Poster Session 6
Hmrishav Bandyopadhyay · Yi-Zhe Song
|
ExHall D Poster #212 | |
Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
Poster Session 5
Xiangjun Gao · Xiaoyu Li · Yiyu Zhuang · Qi Zhang · Wenbo Hu · Chaopeng Zhang · Yao Yao · Ying Shan · Long Quan
|
ExHall D Poster #33 | |
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Poster Session 4
Wangbo Zhao · Yizeng Han · Jiasheng Tang · Zhikai Li · Yibing Song · Kai Wang · Zhangyang Wang · Yang You
|
ExHall D Poster #382 | |
Adaptive Parameter Selection for Tuning Vision-Language Models
Poster Session 1
Yi Zhang · Yi-Xuan Deng · Meng-Hao Guo · Shi-Min Hu
|
ExHall D Poster #392 | |
EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection
Poster Session 3
Yizheng Xie · Viktoria Ehm · Paul Roetzer · Nafie El Amrani · Maolin Gao · Florian Bernard · Daniel Cremers
|
ExHall D Poster #98 | |
RestorGS: Depth-aware Gaussian Splatting for Efficient 3D Scene Restoration
Poster Session 3
Yuanjian Qiao · Mingwen Shao · Lingzhuang Meng · Kai Xu
|
ExHall D Poster #50 | |
SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding
Poster Session 6
Yangliu Hu · Zikai Song · Na Feng · Yawei Luo · Junqing Yu · Yi-Ping Phoebe Chen · Wei Yang
|
ExHall D Poster #283 | |
TCFG: Tangential Damping Classifier-free Guidance
Poster Session 1
Mingi Kwon · Shin seong Kim · Jaeseok Jeong · Yi-Ting Hsiao · Youngjung Uh
|
ExHall D Poster #235 | |
Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model
Poster Session 1
Yuhan Wang · Suzhi Bi · Ying-Jun Angela Zhang · Xiaojun Yuan
|
ExHall D Poster #208 | |
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
Poster Session 2
Vladan Stojnić · Yannis Kalantidis · Jiri Matas · Giorgos Tolias
|
ExHall D Poster #421 | |
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers
Poster Session 6
Mert Bülent Sarıyıldız · Philippe Weinzaepfel · Thomas Lucas · Pau de Jorge · Diane Larlus · Yannis Kalantidis
|
ExHall D Poster #376 | |
FLAVC: Learned Video Compression with Feature Level Attention
Poster Session 6
Chun Zhang · Heming Sun · Jiro Katto
|
ExHall D Poster #176 | |
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Poster Session 2
Lunhao Duan · Shanshan Zhao · Wenjun Yan · Yinglun Li · Qing-Guo Chen · Zhao Xu · Weihua Luo · Kaifu Zhang · Mingming Gong · Gui-Song Xia
|
ExHall D Poster #248 | |
V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents
Poster Session 1
Zhengrong Yue · Shaobin Zhuang · Kunchang Li · Yanbo Ding · Yali Wang
|
ExHall D Poster #290 | |
HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery
Poster Session 2
Yuto Matsubara · Ko Nishino
|
ExHall D Poster #99 | |
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning
Poster Session 6
Quanxing Zha · Xin Liu · Shu-Juan Peng · Yiu-ming Cheung · Xing Xu · Nannan Wang
|
ExHall D Poster #338 | |
AniDoc: Animation Creation Made Easier
Poster Session 4
Yihao Meng · Hao Ouyang · Hanlin Wang · Qiuyu Wang · Wen Wang · Ka Leong Cheng · Zhiheng Liu · Yujun Shen · Huamin Qu
|
ExHall D Poster #227 | |
Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios
Poster Session 3
Hang Shao · lei luo · Jianjun Qian · Mengkai Yan · Shuo Chen · Jian Yang
|
ExHall D Poster #19 | |
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
Poster Session 5
Jianyang Xie · Yitian Zhao · Yanda Meng · He Zhao · Anh Nguyen · Yalin Zheng
|
ExHall D Poster #314 | |
SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Poster Session 1
Junho Kim · Hyunjun Kim · Hosu Lee · Yong Man Ro
|
ExHall D Poster #304 | |
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
Poster Session 6
Byung-Kwan Lee · Ryo Hachiuma · Yu-Chiang Frank Wang · Yong Man Ro · Yueh-Hua Wu
|
ExHall D Poster #324 | |
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Ashmal Vayani · Dinura Dissanayake · Hasindri Watawana · Noor Ahsan · Nevasini Sasikumar · Omkar Thawakar · Henok Biadglign Ademtew · Yahya Hmaiti · Amandeep Kumar · Kartik Kuckreja · Mykola Maslych · Wafa Al Ghallabi · Mihail Minkov Mihaylov · Chao Qin · Abdelrahman Shaker · Mike Zhang · Mahardika Krisna Ihsani · Amiel Gian Esplana · Monil Gokani · Shachar Mirkin · Harsh Singh · Ashay Srivastava · Endre Hamerlik · Fathinah Asma Izzati · Fadillah Adamsyah Maani · Sebastian Cavada · Jenny Chim · Rohit Gupta · Sanjay Manjunath · Kamila Zhumakhanova · Feno Heriniaina Rabevohitra · Azril Hafizi Amirudin · Muhammad Ridzuan · Daniya Najiha Abdul Kareem · Ketan Pravin More · Kunyang Li · Pramesh Shakya · Muhammad Saad · Amirpouya Ghasemaghaei · Amirbek Djanibekov · Dilshod Azizov · Branislava Jankovic · Naman Bhatia · Alvaro Cabrera Berobide · Johan Obando-Ceron · Olympiah Otieno · Fabian Farestam · Muztoba Rabbani · Sanoojan Baliah · Santosh Sanjeev · Abduragim Shtanchaev · Maheen Fatima · Thao Nguyen · Amrin Kareem · Toluwani Aremu · Nathan Augusto Zacarias Xavier · Amit Bhatkal · Hawau Olamide Toyin · Aman Chadha · Hisham Cholakkal · Rao Anwer · Michael Felsberg · Jorma Laaksonen · Thamar Solorio · Monojit Choudhury · Ivan Laptev · Mubarak Shah · Salman Khan · Fahad Shahbaz Khan
|
ExHall D Poster #358 | |
MUSt3R: Multi-view Network for Stereo 3D Reconstruction
Yohann Cabon · Lucas Stoffl · Leonid Antsfeld · Gabriela Csurka · Boris Chidlovskii · Jerome Revaud · Vincent Leroy
|
ExHall D Poster #82 | |
Open-Canopy: Towards Very High Resolution Forest Monitoring
Fajwel Fogel · Yohann PERRON · Nikola Besic · Laurent Saint-André · Agnès Pellissier-Tanon · Thomas Boudras · Martin Schwartz · Ibrahim Fayad · Alexandre d'Aspremont · Loic Landrieu · Philippe Ciais
|
ExHall D Poster #114 | |
Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
Poster Session 6
Hao Guo · Xugong Qin · Jun Jie Ou Yang · peng zhang · Gangyan Zeng · Yubo Li · Hailun Lin
|
ExHall D Poster #343 | |
Articulated Kinematics Distillation from Video Diffusion Models
Poster Session 4
Xuan Li · Qianli Ma · Tsung-Yi Lin · Yongxin Chen · Chenfanfu Jiang · Ming-Yu Liu · Donglai Xiang
|
ExHall D Poster #169 | |
ControlFace: Harnessing Facial Parametric Control for Face Rigging
Poster Session 2
Wooseok Jang · Youngjun Hong · Geonho Cha · Seungryong Kim
|
ExHall D Poster #16 | |
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance
Poster Session 1
Dian Shao · Mingfei Shi · Shengda Xu · Haodong Chen · Yongle Huang · Binglu Wang
|
ExHall D Poster #161 | |
VEU-Bench: Towards Comprehensive Understanding of Video Editing
Bozheng Li · Yongliang Wu · YI LU · Jiashuo Yu · Licheng Tang · Jiawang Cao · Wenqing Zhu · Yuyang Sun · Jay Wu · Wenbo Zhu
|
ExHall D Poster #289 | |
TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
Poster Session 2
Yabiao Wang · Shuo Wang · Jiangning Zhang · Ke Fan · Jiafu Wu · Xuezhucun Xue · Yong Liu
|
ExHall D Poster #173 | |
Action Detail Matters: Refining Video Recognition with Local Action Queries
Poster Session 4
Mengmeng Wang · Zeyi Huang · Xiangjie Kong · Guojiang Shen · Guang Dai · Jingdong Wang · Yong Liu
|
ExHall D Poster #318 | |
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
Poster Session 3
Tai-Yu Daniel Pan · Sooyoung Jeon · Mengdi Fan · Jinsu Yoo · Zhenyang Feng · Mark Campbell · Kilian Q Weinberger · Bharath Hariharan · Wei-Lun Chao
|
ExHall D Poster #133 | |
Vision-Language Models Do Not Understand Negation
Poster Session 6
Kumail Alhamoud · Shaden Alshammari · Yonglong Tian · Guohao Li · Philip H.S. Torr · Yoon Kim · Marzyeh Ghassemi
|
ExHall D Poster #331 | |
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Poster Session 2
Yoojin Jung · Byung Cheol Song
|
ExHall D Poster #412 | |
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
Tomoya Yoshida · Shuhei Kurita · Taichi Nishimura · Shinsuke Mori
|
ExHall D Poster #151 | |
Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection
Poster Session 6
Jinhyung Park · Navyata Sanghvi · Hiroki Adachi · Yoshihisa Shibata · Shawn Hunt · Shinya Tanaka · Hironobu Fujiyioshi · Kris Kitani
|
ExHall D Poster #119 | |
HyperPose: Hypernetwork-Infused Camera Pose Localization and an Extended Cambridge Landmarks Dataset
Poster Session 3
Ron Ferens · Yosi Keller
|
ExHall D Poster #86 | |
Novel View Synthesis with Pixel-Space Diffusion Models
Poster Session 6
Noam Elata · Bahjat Kawar · Yaron Ostrovsky-Berman · Miriam Farber · Ron Sokolovsky
|
ExHall D Poster #59 | |
VladVA: Discriminative Fine-tuning of LVLMs
Poster Session 1
Yassine Ouali · Adrian Bulat · ALEXANDROS XENOS · Anestis Zaganidis · Ioannis Maniadis Metaxas · Brais Martinez · Georgios Tzimiropoulos
|
ExHall D Poster #375 | |
RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds
Poster Session 5
Kang You · Tong Chen · Dandan Ding · M. Salman Asif · Zhan Ma
|
ExHall D Poster #107 | |
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
Poster Session 1
Xingchao Yang · Takafumi Taketomi · Yuki Endo · Yoshihiro Kanamori
|
ExHall D Poster #15 | |
Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability
Poster Session 1
Unki Park · Seongmoon Jeong · Jang Youngchan · Gyeong-Moon Park · Jong Hwan Ko
|
ExHall D Poster #409 | |
CLIP-driven Coarse-to-fine Semantic Guidance for Fine-grained Open-set Semi-supervised Learning
Poster Session 6
Xiaokun Li · Yaping Huang · Qingji Guan
|
ExHall D Poster #399 | |
Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
Poster Session 3
Yiping Wang · Xuehai He · Kuan Wang · Luyao Ma · Jianwei Yang · Shuohang Wang · Simon Shaolei Du · yelong shen
|
ExHall D Poster #284 | |
A Selective Re-learning Mechanism for Hyperspectral Fusion Imaging
Poster Session 2
Yuanye Liu · jinyang liu · Renwei Dian · Shutao Li
|
ExHall D Poster #198 | |
Directional Label Diffusion Model for Learning from Noisy Labels
Poster Session 5
Senyu Hou · Gaoxia Jiang · Jia Zhang · Shangrong Yang · Husheng Guo · Yaqing Guo · Wenjian Wang
|
ExHall D Poster #450 | |
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models
Poster Session 2
Yongqi Huang · Peng Ye · Chenyu Huang · Jianjian Cao · Lin Zhang · Baopu Li · Gang Yu · Tao Chen
|
ExHall D Poster #446 | |
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins
Yao Mu · Tianxing Chen · Zanxin Chen · ShijiaPeng · Zhiqian Lan · Zeyu Gao · Zhixuan Liang · Qiaojun Yu · Yude Zou · Mingkun Xu · Lunkai Lin · Zhiqiang Xie · Mingyu Ding · Ping Luo
|
ExHall D Poster #142 | |
PI-HMR: Towards Robust In-bed Temporal Human Shape Reconstruction with Contact Pressure Sensing
Poster Session 6
Ziyu Wu · Yufan Xiong · Mengting Niu · Fangting Xie · Quan Wan · Qijun Ying · Boyan Liu · Xiaohui Cai
|
ExHall D Poster #150 | |
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks
Poster Session 3
Peng Xie · Yequan Bie · Jianda Mao · Yangqiu Song · Yang Wang · Hao Chen · Kani Chen
|
ExHall D Poster #386 | |
UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation
Poster Session 3
Yinqiao Wang · Hao Xu · Pheng-Ann Heng · Chi-Wing Fu
|
ExHall D Poster #152 | |
Hiding Images in Diffusion Models by Editing Learned Score Functions
Poster Session 4
Haoyu Chen · Yunqiao Yang · Nan Zhong · Kede Ma
|
ExHall D Poster #275 | |
Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch
Poster Session 2
Yijie Liu · Xinyi Shang · Yiqun Zhang · Yang Lu · Chen Gong · Jing-Hao Xue · Hanzi Wang
|
ExHall D Poster #457 | |
BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing
Yunqi Gu · Ian Huang · Jihyeon Je · Guandao Yang · Leonidas Guibas
|
ExHall D Poster #267 | |
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Poster Session 5
Ho-Joong Kim · Yearang Lee · Jung-Ho Hong · Seong-Whan Lee
|
ExHall D Poster #312 | |
DefMamba: Deformable Visual State Space Model
Poster Session 2
Leiye Liu · Miao Zhang · Jihao Yin · Tingwei Liu · Wei Ji · Yongri Piao · Huchuan Lu
|
ExHall D Poster #331 | |
Relation-Rich Visual Document Generator for Visual Information Extraction
Poster Session 3
Zi-Han Jiang · Chien-Wei Lin · WeiHua Li · Hsuan-Tung Liu · Yi-Ren Yeh · Chu-Song Chen
|
ExHall D Poster #363 | |
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation
Poster Session 4
Guy Yariv · Yuval Kirstain · Amit Zohar · Shelly Sheynin · Yaniv Taigman · Yossi Adi · Sagie Benaim · Adam Polyak
|
ExHall D Poster #228 | |
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
Poster Session 6
Yutao Tang · Yuxiang Guo · Deming Li · Cheng Peng
|
ExHall D Poster #64 | |
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts
Poster Session 3
Dmitrii M Petrov · Pradyumn Goyal · Divyansh Shivashok · Yuanming Tao · Melinos Averkiou · Evangelos Kalogerakis
|
ExHall D Poster #253 | |
Enhancing Facial Privacy Protection via Weakening Diffusion Purification
Poster Session 2
Ali Salar · Qing Liu · Yingli Tian · Guoying Zhao
|
ExHall D Poster #273 | |
ABC-Former: Auxiliary Bimodal Cross-domain Transformer with Interactive Channel Attention for White Balance
Poster Session 5
Yu-Cheng Chiu · GUAN-RONG CHEN · Zihao Chen · Yan-Tsung Peng
|
ExHall D Poster #20 | |
PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing
Poster Session 4
Peng Li · Wangguandong Zheng · Yuan Liu · Tao Yu · Yangguang Li · Xingqun Qi · Xiaowei Chi · Siyu Xia · Yan-Pei Cao · Wei Xue · Wenhan Luo · Yike Guo
|
ExHall D Poster #14 | |
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu · Jingyang Zhang · Tian Fang · Jean-Daniel Nahmias · Yanghai Tsin · Long Quan · Xun Cao · Yao Yao · Shiwei Li
|
ExHall D Poster #57 | |
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception
Poster Session 3
Yuanchen Wu · Lu Zhang · Hang Yao · Junlong Du · Ke Yan · Shouhong Ding · Yunsheng Wu · Xiaoqiang Li
|
ExHall D Poster #383 | |
MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning
Poster Session 2
Xu Han · Yuan Tang · Jinfeng Xu · Xianzhi Li
|
ExHall D Poster #116 | |
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
Poster Session 6
Lei Chen · Yuan Meng · Chen Tang · Xinzhu Ma · Jingyan Jiang · Xin Wang · Zhi Wang · Wenwu Zhu
|
ExHall D Poster #204 | |
Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather
Poster Session 1
Longyu Yang · Ping Hu · Shangbo Yuan · Lu Zhang · Jun Liu · Heng Tao Shen · Xiaofeng Zhu
|
ExHall D Poster #117 | |
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
Poster Session 2
Shuaiwei Yuan · Junyu Dong · Yuezun Li
|
ExHall D Poster #324 | |
Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation
Poster Session 1
Qiao Yu · Xianzhi Li · Yuan Tang · Xu Han · Long Hu · yixue Hao · Min Chen
|
ExHall D Poster #40 | |
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Poster Session 1
Liao Qu · Huichao Zhang · Yiheng Liu · Xu Wang · Yi Jiang · Yiming Gao · Hu Ye · Daniel Kang Du · Zehuan Yuan · Xinglong Wu
|
ExHall D Poster #228 | |
ATA: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Poster Session 4
Yizhe Tang · Zhimin Sun · Yuzhen Du · Ran Yi · Guangben Lu · Teng Hu · LUYING LI · Lizhuang Ma · FangYuan Zou
|
ExHall D Poster #242 | |
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
Poster Session 1
Ruineng Li · Daitao Xing · Huiming Sun · Yuanzhou Ha · Jinglin Shen · Chiuman Ho
|
ExHall D Poster #165 | |
Neural Motion Simulator Pushing the Limit of World Models in Reinforcement Learning
Poster Session 6
Chenjie Hao · Weyl Lu · Yifan Xu · Yubei Chen
|
ExHall D Poster #138 | |
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
Jingxuan Wei · Cheng Tan · Qi Chen · Gaowei Wu · Siyuan Li · Zhangyang Gao · Linzhuang Sun · Bihui Yu · Ruifeng Guo
|
ExHall D Poster #254 | |
SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model
Poster Session 5
Yucheng Mao · Boyang Wang · Nilesh Kulkarni · Jeong Joon Park
|
ExHall D Poster #54 | |
LaVin-DiT: Large Vision Diffusion Transformer
Poster Session 4
Zhaoqing Wang · Xiaobo Xia · Runnan Chen · Dongdong Yu · Changhu Wang · Mingming Gong · Tongliang Liu
|
ExHall D Poster #406 | |
HERA: Hybrid Explicit Representation for Ultra-Realistic Head Avatars
Poster Session 1
Hongrui Cai · Yuting Xiao · Xuan Wang · Jiafei Li · Yudong Guo · Yanbo Fan · Shenghua Gao · Juyong Zhang
|
ExHall D Poster #9 | |
Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients
Poster Session 1
Li Lun · Kunyu Feng · Qinglong Ni · Ling Liang · Yuan Wang · Ying Li · dunshan yu · Xiaoxin CUI
|
ExHall D Poster #321 | |
Learning Physics-Based Full-Body Human Reaching and Grasping from Brief Walking References
Poster Session 6
Yitang Li · Mingxian Lin · Zhuo Lin · Yipeng Deng · Yue Cao · Li Yi
|
ExHall D Poster #144 | |
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Poster Session 4
Yang Yue · Yulin Wang · Chenxin Tao · Pan Liu · Shiji Song · Gao Huang
|
ExHall D Poster #473 | |
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Poster Session 5
Yang Yue · Yulin Wang · Haojun Jiang · Pan Liu · Shiji Song · Gao Huang
|
ExHall D Poster #476 | |
Scene-agnostic Pose Regression for Visual Localization
Poster Session 6
Junwei Zheng · Ruiping Liu · Yufan Chen · Zhenfang Chen · Kailun Yang · Jiaming Zhang · Rainer Stiefelhagen
|
ExHall D Poster #90 | |
Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild
Poster Session 3
Wei Liu · Yufei Chen · Xiaodong Yue
|
ExHall D Poster #465 | |
Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging
Xianrui Li · Yufei Cui · Jun Li · Antoni B. Chan
|
ExHall D Poster #475 | |
Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition
Poster Session 4
Zhiyuan Chen · Keyi Li · Yifan Jia · Le Ye · Yufei Ma
|
ExHall D Poster #210 | |
EventGPT: Event Stream Understanding with Multimodal Large Language Models
Poster Session 6
shaoyu liu · Jianing Li · guanghui zhao · Yunjian Zhang · Xin Meng · Fei Richard Yu · Xiangyang Ji · Ming Li
|
ExHall D Poster #286 | |
SuperLightNet: Lightweight Parameter Aggregation Network for Multimodal Brain Tumor Segmentation
Poster Session 1
Feng Yu · Jiacheng Cao · Li Liu · Minghua Jiang
|
ExHall D Poster #481 | |
Spiking Transformer with Spatial-Temporal Attention
Poster Session 3
Donghyun Lee · Yuhang Li · Youngeun Kim · Shiting Xiao · Priyadarshini Panda
|
ExHall D Poster #315 | |
Multi-Modal Synergistic Implicit Image Enhancement for Efficient Optical Flow Estimation
Poster Session 1
Weichen Dai · wu hexing · xiaoyang weng · Yuxin Zheng · Yuhang Ming · Wanzeng Kong
|
ExHall D Poster #188 | |
Pay Attention to the Foreground in Object-Centric Learning
Poster Session 6
Pinzhuo Tian · Shengjie Yang · Hang Yu · Alex C. Kot
|
ExHall D Poster #396 | |
The Devil is in Low-Level Features for Cross-Domain Few-Shot Segmentation
Poster Session 1
Yuhan Liu · Yixiong Zou · Yuhua Li · Ruixuan Li
|
ExHall D Poster #426 | |
Spiking Transformer: Introducing Accurate Addition-Only Spiking Self-Attention for Transformer
Poster Session 5
Yufei Guo · Xiaode Liu · Yuanpei Chen · Weihang Peng · Yuhan Zhang · Zhe Ma
|
ExHall D Poster #322 | |
FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting
Poster Session 3
Fangyu Wu · Yuhao Chen
|
ExHall D Poster #38 | |
Language-Guided Salient Object Ranking
Poster Session 6
Fang Liu · Yuhao Liu · Ke Xu · Shuquan Ye · Gerhard Hancke · Rynson W.H. Lau
|
ExHall D Poster #350 | |
VODiff: Controlling Object Visibility Order in Text-to-Image Generation
Poster Session 4
Dong Liang · Jinyuan Jia · Yuhao Liu · Zhanghan Ke · Hongbo Fu · Rynson W.H. Lau
|
ExHall D Poster #246 | |
Multirate Neural Image Compression with Adaptive Lattice Vector Quantization
Hao Xu · Xiaolin Wu · Xi Zhang
|
ExHall D Poster #216 | |
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Poster Session 4
Alejandro Lozano · Min Woo Sun · James Burgess · Liangyu Chen · Jeffrey J Nirschl · Jeffrey Gu · Ivan Lopez · Josiah Aklilu · Austin Wolfgang Katzer · Collin Chiu · Anita Rau · Xiaohan Wang · Yuhui Zhang · Alfred Seunghoon Song · Robert Tibshirani · Serena Yeung
|
ExHall D Poster #374 | |
Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution
Qihao Liu · Xi Yin · Alan L. Yuille · Andrew Brown · Mannat Singh
|
ExHall D Poster #249 | |
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Wufei Ma · Luoxin Ye · Nessa McWeeney · Celso M. de Melo · Alan L. Yuille · Jieneng Chen
|
ExHall D Poster #137 | |
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models
Xingrui Wang · Wufei Ma · Tiezheng Zhang · Celso M. de Melo · Jieneng Chen · Alan L. Yuille
|
ExHall D Poster #348 | |
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Poster Session 1
Bingjie Gao · Xinyu Gao · Xiaoxue Wu · yujie zhou · Yu Qiao · Li Niu · Xinyuan Chen · Yaohui Wang
|
ExHall D Poster #288 | |
DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Post-Capture Refocusing, Defocus Rendering and Blur Removal
Poster Session 5
Yujie Wang · Praneeth Chakravarthula · Baoquan Chen
|
ExHall D Poster #24 | |
One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency
Poster Session 4
Li Jin · Yujie Wang · Wenzheng Chen · Qiyu Dai · Qingzhe Gao · Xueying Qin · Baoquan Chen
|
ExHall D Poster #98 | |
BG-Triangle: Bézier Gaussian Triangle for 3D Vectorization and Rendering
Poster Session 4
Minye Wu · Haizhao Dai · Kaixin Yao · Jingyi Yu · Tinne Tuytelaars
|
ExHall D Poster #33 | |
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Poster Session 1
Chunlin Yu · Hanqing Wang · Ye Shi · Haoyang Luo · Sibei Yang · Jingyi Yu · Jingya Wang
|
ExHall D Poster #142 | |
HandOS: 3D Hand Reconstruction in One Stage
Poster Session 4
Xingyu Chen · Zhuheng Song · Xiaoke Jiang · Yaoqing Hu · Junzhi Yu · Lei Zhang
|
ExHall D Poster #142 | |
ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Poster Session 3
Shuya Yang · Shaozhe Hao · Yukang Cao · Kwan-Yee K. Wong
|
ExHall D Poster #240 | |
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Poster Session 4
Senqiao Yang · Yukang Chen · Zhuotao Tian · Chengyao Wang · Jingyao Li · Bei Yu · Jiaya Jia
|
ExHall D Poster #380 | |
GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding
Poster Session 1
Yuki Kawana · Shintaro Shiba · Quan Kong · Norimasa Kobori
|
ExHall D Poster #279 | |
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
Poster Session 2
Van-Tin Luu · Yong-Lin Cai · Vu-Hoang Tran · Wei-Chen Chiu · Yi-Ting Chen · Ching-Chun Huang
|
ExHall D Poster #127 | |
H-MoRe: Learning Human-centric Motion Representation for Action Analysis
Zhanbo Huang · Xiaoming Liu · Yu Kong
|
ExHall D Poster #156 | |
Reasoning Mamba: Hypergraph-Guided Region Relation Calculating for Weakly Supervised Affordance Grounding
Poster Session 6
Yuxuan Wang · Aming Wu · Muli Yang · Yukuan Min · Yihang Zhu · Cheng Deng
|
ExHall D Poster #139 | |
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Poster Session 3
Junjie Wang · BIN CHEN · Yulin Li · Bin Kang · Yichi Chen · Zhuotao Tian
|
ExHall D Poster #399 | |
DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables
Poster Session 2
Sidi Yang · Binxiao Huang · Yulun Zhang · Dahai Yu · Yujiu Yang · Ngai Wong
|
ExHall D Poster #211 | |
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution
Poster Session 3
Zhu Li Bo · Jianze Li · Haotong Qin · Wenbo Li · Yulun Zhang · Yong Guo · Xiaokang Yang
|
ExHall D Poster #203 | |
Proximal Algorithm Unrolling: Flexible and Efficient Reconstruction Networks for Single-Pixel Imaging
Poster Session 1
Ping Wang · Lishun Wang · Gang Qu · Xiaodong Wang · Yulun Zhang · Xin Yuan
|
ExHall D Poster #23 | |
FrugalNeRF: Fast Convergence for Extreme Few-shot Novel View Synthesis without Learned Priors
Poster Session 3
Chin-Yang Lin · Chung-Ho Wu · Changhan Yeh · Shih Han Yen · Cheng Sun · Yu-Lun Liu
|
ExHall D Poster #55 | |
GCC: Generative Color Constancy via Diffusing a Color Checker
Poster Session 3
Chen-Wei Chang · Cheng-De Fan · Chia-Che Chang · Yi-Chen Lo · Yu-Chee Tseng · Jiun-Long Huang · Yu-Lun Liu
|
ExHall D Poster #20 | |
UniK3D: Universal Camera Monocular 3D Estimation
Poster Session 1
Luigi Piccinelli · Christos Sakaridis · Mattia Segu · Yung-Hsu Yang · Siyuan Li · Wim Abbeloos · Luc Van Gool
|
ExHall D Poster #80 | |
Illumination Spectrum Estimation for Multispectral Images via Surface Reflectance Modeling and Spatial-Spectral Feature Generation
Poster Session 1
Hyejin Oh · Woo-Shik Kim · Sangyoon Lee · YungKyung Park · Jewon Kang
|
ExHall D Poster #192 | |
Solving Instance Detection from an Open-World Perspective
Poster Session 2
Qianqian Shen · Yunhan Zhao · Nahyun Kwon · Jeeeun Kim · Yanan Li · Shu Kong
|
ExHall D Poster #431 | |
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
Yunhong Lu · Qichao Wang · Hengyuan Cao · Xierui Wang · Xiaoyin Xu · Min Zhang
|
ExHall D Poster #235 | |
STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search
Poster Session 6
Yuning Qiu · Andong Wang · Chao Li · Haonan Huang · Guoxu Zhou · Qibin Zhao
|
ExHall D Poster #236 | |
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Poster Session 2
Yunlong Tang · JunJia Guo · Hang Hua · Susan Liang · Mingqian Feng · Xinyang Li · Rui Mao · Chao Huang · Jing Bi · Zeliang Zhang · Pooyan Fazli · Chenliang Xu
|
ExHall D Poster #297 | |
SparseAlign: a Fully Sparse Framework for Cooperative Object Detection
Poster Session 5
Yunshuang Yuan · Yan Xia · Daniel Cremers · Monika Sester
|
ExHall D Poster #119 | |
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
Poster Session 1
Jing Bi · Lianggong Bruce Wen · Zhang Liu · JunJia Guo · Yunlong Tang · Bingjie Wang · Chenliang Xu
|
ExHall D Poster #378 | |
Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes
Poster Session 1
Yiming Dou · Wonseok Oh · Yuqing Luo · Antonio Loquercio · Andrew Owens
|
ExHall D Poster #151 | |
Magma: A Foundation Model for Multimodal AI Agents
Poster Session 3
Jianwei Yang · Reuben Tan · Qianhui Wu · Ruijie Zheng · Baolin Peng · Yongyuan Liang · Yu Gu · Mu Cai · Seonghyeon Ye · Joel Jang · Yuquan Deng · Jianfeng Gao
|
ExHall D Poster #340 | |
Zero-Shot 4D Lidar Panoptic Segmentation
Poster Session 5
Yushan Zhang · Aljoša Ošep · Laura Leal-Taixe · Tim Meinhardt
|
ExHall D Poster #332 | |
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation
Poster Session 3
Shivam Duggal · Yushi Hu · Oscar Michel · Aniruddha Kembhavi · William Freeman · Noah A. Smith · Ranjay Krishna · Antonio Torralba · Ali Farhadi · Wei-Chiu Ma
|
ExHall D Poster #255 | |
LidarGait++: Learning Local Features and Size Awareness from LiDAR Point Clouds for 3D Gait Recognition
Poster Session 2
Chuanfu Shen · Rui Wang · Lixin Duan · Shiqi Yu
|
ExHall D Poster #120 | |
Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization
Poster Session 3
Anubhav Jain · Yuya Kobayashi · Takashi Shibuya · Yuhta Takida · Nasir Memon · Julian Togelius · Yuki Mitsufuji
|
ExHall D Poster #212 | |
From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models
Poster Session 1
German Barquero · Nadine Bertsch · Manojkumar Marramreddy · Carlos Chacón · Filippo Arcadu · Ferran Rigual · Nicky Sijia He · Cristina Palmero · Sergio Escalera · Yuting Ye · Robin Kips
|
ExHall D Poster #156 | |
Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Poster Session 1
Yutong Wang · Jiajie Teng · Jiajiong Cao · Yuming Li · Chenguang Ma · Hongteng Xu · Dixin Luo
|
ExHall D Poster #189 | |
Probabilistic Prompt Distribution Learning for Animal Pose Estimation
Poster Session 6
Jiyong Rao · Brian Nlong Zhao · Yu Wang
|
ExHall D Poster #314 | |
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Tianyu Yu · Haoye Zhang · Qiming Li · Qixin Xu · Yuan Yao · Da Chen · Xiaoman Lu · Ganqu Cui · Yunkai Dang · Taiwen He · Xiaocheng Feng · Jun Song · Bo Zheng · Zhiyuan Liu · Tat-seng Chua · Maosong Sun
|
ExHall D Poster #399 | |
ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics
Poster Session 6
Junchao Zhu · Ruining Deng · Tianyuan Yao · Juming Xiong · Chongyu Qu · Junlin Guo · Siqi Lu · Mengmeng Yin · Yu Wang · Shilin Zhao · Haichun Yang · Yuankai Huo
|
ExHall D Poster #448 | |
Revisiting Source-Free Domain Adaptation: Insights into Representativeness, Generalization, and Variety
Poster Session 5
Ronghang Zhu · Mengxuan Hu · Weiming Zhuang · Lingjuan Lyu · Xiang Yu · Sheng Li
|
ExHall D Poster #445 | |
MoFlow: One-Step Flow Matching for Human Trajectory Forecasting via Implicit Maximum Likelihood Estimation based Distillation
Poster Session 4
Yuxiang Fu · Qi Yan · Ke Li · Lele Wang · Renjie Liao
|
ExHall D Poster #140 | |
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Poster Session 2
Boyong He · Yuxiang Ji · Qianwen Ye · Zhuoyue Tan · Liaoni Wu
|
ExHall D Poster #433 | |
EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark
Poster Session 3
Ming Li · Jike Zhong · Tianle Chen · Yuxiang Lai · Konstantinos Psounis
|
ExHall D Poster #256 | |
ACE: Anti-Editing Concept Erasure in Text-to-Image Models
Poster Session 5
Zihao Wang · Yuxiang Wei · Fan Li · Renjing Pei · Hang Xu · Wangmeng Zuo
|
ExHall D Poster #234 | |
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
Song Xia · Yi Yu · Wenhan Yang · MEIWEN DING · Zhuo Chen · Ling-Yu Duan · Alex C. Kot · Xudong Jiang
|
ExHall D Poster #323 | |
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
Poster Session 1
Ming-Feng Li · Xin Yang · Fu-En Wang · Hritam Basak · Yuyin Sun · Shreekant Gayaka · Min Sun · Cheng-Hao Kuo
|
ExHall D Poster #94 | |
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Poster Session 4
Yi Yu · Botao Ren · Peiyuan Zhang · Mingxin Liu · Junwei Luo · Shaofeng Zhang · Feipeng Da · Junchi Yan · Xue Yang
|
ExHall D Poster #332 | |
g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks
Poster Session 3
Zihan Wang · Gim Hee Lee
|
ExHall D Poster #339 | |
DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation
Poster Session 3
Chun-Hung Wu · Shih-Hong Chen · Chih Yao Hu · Hsin-Yu Wu · Kai-Hsin Chen · Yu-You Chen · Chih-Hai Su · Chih-Kuo Lee · Yu-Lun Liu
|
ExHall D Poster #482 | |
Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications
Poster Session 5
Tong Bu · Maohua Li · Zhaofei Yu
|
ExHall D Poster #321 | |
Self-Supervised Learning for Color Spike Camera Reconstruction
Poster Session 2
Yanchen Dong · Ruiqin Xiong · Xiaopeng Fan · Zhaofei Yu · Yonghong Tian · Tiejun Huang
|
ExHall D Poster #76 | |
USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting
Kang Chen · Jiyuan Zhang · Zecheng Hao · Yajing Zheng · Tiejun Huang · Zhaofei Yu
|
ExHall D Poster #74 | |
vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation
Poster Session 4
Bastian Wittmann · Yannick Wattenberg · Tamaz Amiranashvili · Suprosanna Shit · Bjoern Menze
|
ExHall D Poster #482 | |
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians
Poster Session 6
Changfeng Ma · Ran Bi · Jie Guo · Chongjun Wang · Yanwen Guo
|
ExHall D Poster #108 | |
SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction
Poster Session 2
Xinran Yang · Donghao Ji · Yuanqi Li · Jie Guo · Yanwen Guo · Junyuan Xie
|
ExHall D Poster #33 | |
EdgeMovingNet: Edge-preserving Point Cloud Reconstruction via Joint Geometry Features
Poster Session 5
Xinran Yang · Donghao Ji · Yuanqi Li · Junyuan Xie · Jie Guo · Yanwen Guo
|
ExHall D Poster #105 | |
CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification
Poster Session 1
wenlong yu · Qilong Wang · Chuang Liu · Dong Li · Qinghua Hu
|
ExHall D Poster #403 | |
VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Poster Session 1
Kangsan Kim · Geon Park · Youngwan Lee · Woongyeong Yeo · Sung Ju Hwang
|
ExHall D Poster #299 | |
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events
Poster Session 4
Jesse Hagenaars · Yilun Wu · Federico Paredes Valles · Stein Stroobants · Guido De Croon
|
ExHall D Poster #124 | |
Active Data Curation Effectively Distills Large-Scale Multimodal Models
Poster Session 3
Vishaal Udandarao · Nikhil Parthasarathy · Muhammad Ferjad Naeem · Talfan Evans · Samuel Albanie · Federico Tombari · Yongqin Xian · Alessio Tonioni · Olivier J Henaff
|
ExHall D Poster #361 | |
Model Poisoning Attacks to Federated Learning via Multi-Round Consistency
Poster Session 3
Yueqi Xie · Minghong Fang · Neil Zhenqiang Gong
|
ExHall D Poster #460 | |
Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning
Poster Session 5
Li-Jun Zhao · Zhen-Duo Chen · Yongxin Wang · Xin Luo · Xin-Shun Xu
|
ExHall D Poster #441 | |
EdgeTAM: On-Device Track Anything Model
Poster Session 3
Chong Zhou · Chenchen Zhu · Yunyang Xiong · Saksham Suri · Fanyi Xiao · Lemeng Wu · Raghuraman Krishnamoorthi · Bo Dai · Chen Change Loy · Vikas Chandra · Bilge Soran
|
ExHall D Poster #304 | |
Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
Poster Session 1
Yingxue Xu · Fengtao ZHOU · Chenyu Zhao · Yihui Wang · Can Yang · Hao Chen
|
ExHall D Poster #472 | |
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
Poster Session 5
Yifan Liu · Keyu Fan · Weihao Yu · Chenxin Li · Hao Lu · Yixuan Yuan
|
ExHall D Poster #49 | |
BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting
Poster Session 4
Yiren Lu · Yunlai Zhou · Disheng Liu · tuo liang · Yu Yin
|
ExHall D Poster #66 | |
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model
Poster Session 2
Changchang Sun · Gaowen Liu · Charles Fleming · Yan Yan
|
ExHall D Poster #282 | |
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Poster Session 2
Sara Al-Emadi · Yin Yang · Ferda Ofli
|
ExHall D Poster #280 | |
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
Poster Session 1
Yuanbo Yang · Jiahao Shao · Xinyang Li · Yujun Shen · Andreas Geiger · Yiyi Liao
|
ExHall D Poster #258 | |
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Poster Session 5
Jiahao Shao · Yuanbo Yang · Hongyu Zhou · Youmin Zhang · Yujun Shen · Vitor Guizilini · Yue Wang · Matteo Poggi · Yiyi Liao
|
ExHall D Poster #170 | |
Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition
Junyi Wu · Yan Huang · Min Gao · Yuzhen Niu · Yuzhong Chen · Qiang Wu
|
ExHall D Poster #400 | |
URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration
Poster Session 5
Rui Xu · Yuzhen Niu · Yuezhou Li · Huangbiao Xu · Wenxi Liu · Yuzhong Chen
|
ExHall D Poster #21 | |
Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model
Poster Session 6
Yuting Zhang · Hao Lu · Qingyong Hu · Yin Wang · Kaishen Yuan · Xin Liu · Kaishun Wu
|
ExHall D Poster #295 | |
Seeing A 3D World in A Grain of Sand
Poster Session 3
Yufan Zhang · Yu Ji · Yu Guo · Jinwei Ye
|
ExHall D Poster #51 | |
Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model
Poster Session 1
Ziyuan Yang · Yingyu Chen · Zhiwen Wang · Hongming Shan · Yang Chen · Yi Zhang
|
ExHall D Poster #477 | |
DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection
Li Li · Huixian Gong · Hao Dong · Tiankai Yang · Zhengzhong Tu · Yue Zhao
|
ExHall D Poster #459 | |
EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
Poster Session 4
Shuoyan Wei · Feng Li · Shengeng Tang · Yao Zhao · Huihui Bai
|
ExHall D Poster #186 | |
Track Any Anomalous Object:A Granular Video Anomaly Detection Pipeline
Poster Session 2
Yuzhi Huang · Chenxin Li · Haitao Zhang · Zixu Lin · yunlong lin · Hengyu Liu · Wuyang Li · Xinyu Liu · Jiechao Gao · Yue Huang · Xinghao Ding · Yixuan Yuan
|
ExHall D Poster #317 | |
STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
Poster Session 1
Haiyi Qiu · Minghe Gao · Long Qian · Kaihang Pan · Qifan Yu · Juncheng Li · Wenjie Wang · Siliang Tang · Yueting Zhuang · Tat-seng Chua
|
ExHall D Poster #298 | |
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Poster Session 5
Aishik Konwer · Zhijian Yang · Erhan Bas · Cao Xiao · Prateek Prasanna · Parminder Bhatia · Taha Kass-Hout
|
ExHall D Poster #456 | |
Distraction is All You Need for Multimodal Large Language Model Jailbreaking
Zuopeng Yang · Jiluan Fan · Anli Yan · Erdun Gao · Xin Lin · Tao Li · Kanghua Mo · Changyu Dong
|
ExHall D Poster #390 | |
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
Poster Session 1
Qianlong Xiang · Miao Zhang · Yuzhang Shang · Jianlong Wu · Yan Yan · Liqiang Nie
|
ExHall D Poster #267 | |
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Poster Session 3
Kai Wang · Mingjia Shi · YuKun Zhou · Zekai Li · Xiaojiang Peng · Zhihang Yuan · Yuzhang Shang · Hanwang Zhang · Yang You
|
ExHall D Poster #218 | |
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Poster Session 4
Shengqu Cai · Eric Ryan Chan · Yunzhi Zhang · Leonidas Guibas · Jiajun Wu · Gordon Wetzstein
|
ExHall D Poster #254 | |
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Poster Session 6
Haotian Wang · Yuzhe Weng · Yueyan Li · Zilu Guo · Jun Du · Shutong Niu · Jiefeng Ma · Shan He · Wu Xiaoyan · Qiming Hu · Bing Yin · Cong Liu · Qingfeng Liu
|
ExHall D Poster #1 | |
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
Poster Session 3
Jiahe Li · Jiawei Zhang · Xiao Bai · Jin Zheng · Jun Zhou · Lin Gu
|
ExHall D Poster #4 | |
Online Video Understanding: OVBench and VideoChat-Online
Poster Session 1
Zhenpeng Huang · Xinhao Li · Jiaqi Li · Jing Wang · Xiangyu Zeng · Cheng Liang · Tao Wu · Xi Chen · Liang Li · Limin Wang
|
ExHall D Poster #302 | |
Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation
Poster Session 6
Qingchen Tang · Lei Fan · Maurice Pagnucco · Yang Song
|
ExHall D Poster #395 | |
Interpretable Image Classification via Non-parametric Part Prototype Learning
Poster Session 2
Zhijie Zhu · Lei Fan · Maurice Pagnucco · Yang Song
|
ExHall D Poster #418 | |
Soft Self-labeling and Potts Relaxations for Weakly-supervised Segmentation
Poster Session 4
Zhongwen Zhang · Yuri Boykov
|
ExHall D Poster #423 | |
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research
Poster Session 4
James Burgess · Jeffrey J Nirschl · Laura Bravo-Sánchez · Alejandro Lozano · Sanket Rajan Gupte · Jesus G. Galaz-Montoya · Yuhui Zhang · Yuchang Su · Disha Bhowmik · Zachary Coman · Sarina M. Hasan · Alexandra Johannesson · William D. Leineweber · Malvika G Nair · Ridhi Yarlagadda · Connor Zuraski · Wah Chiu · Sarah Cohen · Jan N. Hansen · Manuel D Leonetti · Chad Liu · Emma Lundberg · Serena Yeung
|
ExHall D Poster #357 | |
DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery
Poster Session 1
Jing Gao · Ce Zheng · Laszlo Jeni · Zackory Erickson
|
ExHall D Poster #154 | |
MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion
Poster Session 5
Zador Pataki · Paul-Edouard Sarlin · Johannes Schönberger · Marc Pollefeys
|
ExHall D Poster #80 | |
Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality
Poster Session 2
Liyan Chen · Gregory P. Meyer · Zaiwei Zhang · Eric M. Wolff · Paul Vernaza
|
ExHall D Poster #117 | |
GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction
Poster Session 5
Li Zhang · mingliang xu · Jianan Wang · Qiaojun Yu · Lixin Yang · Yonglu Li · Cewu Lu · RujingWang · Liu Liu
|
ExHall D Poster #150 | |
MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities
Poster Session 3
Federico Lincetto · Gianluca Agresti · Mattia Rossi · Pietro Zanuttigh
|
ExHall D Poster #29 | |
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation
Poster Session 1
Olga Zatsarynna · Emad Bahrami · Yazan Abu Farha · Gianpiero Francesca · Jürgen Gall
|
ExHall D Poster #312 | |
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction
Poster Session 3
Enrico Pallotta · Sina Mokhtarzadeh Azar · Shuai Li · Olga Zatsarynna · Jürgen Gall
|
ExHall D Poster #300 | |
Morpheus: Text-Driven 3D Gaussian Splat Shape and Color Stylization
Poster Session 2
Jamie Wynn · Zawar Qureshi · Jakub Powierza · Jamie Watson · Mohamed Sayed
|
ExHall D Poster #235 | |
CroCoDL: Cross-device Collaborative Dataset for Localization
Poster Session 6
Hermann Blum · Alessandro Mercurio · Joshua O'Reilly · Tim Engelbracht · Mihai Dusmanu · Marc Pollefeys · Zuria Bauer
|
ExHall D Poster #121 | |
HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion
Poster Session 2
Yifang Xu · BenXiang Zhai · Yunzhuo Sun · Ming Li · Yang Li · Sidan Du
|
ExHall D Poster #17 | |
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
Yuning Han · Bingyin Zhao · Rui Chu · Feng Luo · Biplab Sikdar · Yingjie Lao
|
ExHall D Poster #323 | |
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning
Poster Session 2
Bozhou Zhang · Nan Song · Xin Jin · Li Zhang
|
ExHall D Poster #142 | |
4Deform: Neural Surface Deformation for Robust Shape Interpolation
Poster Session 2
Lu Sang · Zehranaz Canfes · Dongliang Cao · Riccardo Marin · Florian Bernard · Daniel Cremers
|
ExHall D Poster #111 | |
Scaling Properties of Diffusion Models For Perceptual Tasks
Poster Session 3
Rahul Ravishankar · Zeeshan Patel · Jathushan Rajasegaran · Jitendra Malik
|
ExHall D Poster #219 | |
Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
Poster Session 5
Chan Hur · Jeong-hun Hong · Dong-hun Lee · Dabin Kang · Semin Myeong · Sang-hyo Park · Hyeyoung Park
|
ExHall D Poster #292 | |
SceneCrafter: Controllable Multi-View Driving Scene Editing
Poster Session 2
Zehao Zhu · Yuliang Zou · Chiyu “Max” Jiang · Bo Sun · Vincent Casser · XIUKUN HUANG · Jiahao Wang · Zhenpei Yang · Ruiqi Gao · Leonidas Guibas · Mingxing Tan · Dragomir Anguelov
|
ExHall D Poster #138 | |
ZeroVO: Visual Odometry with Minimal Assumptions
Poster Session 4
Lei Lai · Zekai Yin · Eshed Ohn-Bar
|
ExHall D Poster #122 | |
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections
Poster Session 1
Weiqi Feng · Dong Han · Zekang Zhou · Shunkai Li · Xiaoqiang Liu · Pengfei Wan · Di ZHANG · Miao Wang
|
ExHall D Poster #8 | |
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
Poster Session 3
Zelin Peng · Zhengqin Xu · Zhilin Zeng · Yu Huang · Yaoming Wang · Wei Shen
|
ExHall D Poster #417 | |
Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space
Poster Session 1
Zelin Peng · Zhengqin Xu · Zhilin Zeng · Changsong Wen · Yu Huang · Menglin Yang · feilong tang · Wei Shen
|
ExHall D Poster #421 | |
Star with Bilinear Mapping
Poster Session 5
Zelin Peng · Yu Huang · Zhengqin Xu · feilong tang · Ming Hu · Xiaokang Yang · Wei Shen
|
ExHall D Poster #406 | |
Domain Generalization in CLIP via Learning with Diverse Text Prompts
Poster Session 2
Changsong Wen · Zelin Peng · Yu Huang · Xiaokang Yang · Wei Shen
|
ExHall D Poster #399 | |
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation
Poster Session 5
Yuheng Feng · Changsong Wen · Zelin Peng · Li jiaye · Siyu Zhu
|
ExHall D Poster #369 | |
Visual Consensus Prompting for Co-Salient Object Detection
Poster Session 2
Jie Wang · Nana Yu · Zihao Zhang · Yahong Han
|
ExHall D Poster #402 | |
Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge
Poster Session 5
Yaqi Zhao · Yuanyang Yin · Lin Li · Mingan Lin · Victor Shea-Jay Huang · Siwei Chen · Weipeng Chen · Baoqun Yin · Zenan Zhou · Wentao Zhang
|
ExHall D Poster #374 | |
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
Poster Session 4
You Wu · Xucheng Wang · Xiangyang Yang · Mengyuan Liu · Dan Zeng · Hengzhou Ye · Shuiwang Li
|
ExHall D Poster #123 | |
Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space
Poster Session 1
Yifan Zhou · Zeqi Xiao · Shuai Yang · Xingang Pan
|
ExHall D Poster #214 | |
MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks
Poster Session 6
Zeqi Zhu · Ibrahim Batuhan Akkaya · Luc Waeijen · Egor Bondarev · Arash Pourtaherian · Orlando Moreira
|
ExHall D Poster #302 | |
Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering
Poster Session 1
Yutao Feng · Xiang Feng · Yintong Shang · Ying Jiang · Chang Yu · Zeshun Zong · Tianjia Shao · Hongzhi Wu · Kun Zhou · Chenfanfu Jiang · Yin Yang
|
ExHall D Poster #33 | |
Population Normalization for Federated Learning
Poster Session 2
Zhuoyao Wang · Fan Yi · Peizhu Gong · Caitou He · Cheng Jin · Weizhong Zhang
|
ExHall D Poster #461 | |
Generating 3D-Consistent Videos from Unposed Internet Photos
Poster Session 6
Gene Chou · Kai Zhang · Sai Bi · Hao Tan · Zexiang Xu · Fujun Luan · Bharath Hariharan · Noah Snavely
|
ExHall D Poster #168 | |
Turbo3D: Ultra-fast Text-to-3D Generation
Poster Session 5
Hanzhe Hu · Tianwei Yin · Fujun Luan · Yiwei Hu · Hao Tan · Zexiang Xu · Sai Bi · Shubham Tulsiani · Kai Zhang
|
ExHall D Poster #252 | |
GenAssets: Generating in-the-wild 3D Assets in Latent Space
Poster Session 5
Ze Yang · Jingkang Wang · Haowei Zhang · Sivabalan Manivasagam · Yun Chen · Raquel Urtasun
|
ExHall D Poster #128 | |
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset
Poster Session 1
Xiao Wang · Fuling Wang · Yuehang Li · Qingchuan Ma · Shiao Wang · Bo Jiang · Jin Tang
|
ExHall D Poster #474 | |
Context-Aware Multimodal Pretraining
Karsten Roth · Zeynep Akata · Dima Damen · Ivana Balazevic · Olivier J Henaff
|
ExHall D Poster #391 | |
FLAIR: VLM with Fine-grained Language-informed Image Representations
Poster Session 5
Rui Xiao · Sanghwan Kim · Iuliana Georgescu · Zeynep Akata · Stephan Alaniz
|
ExHall D Poster #368 | |
How to Merge Your Multimodal Models Over Time?
Poster Session 4
Sebastian Dziadzio · Vishaal Udandarao · Karsten Roth · Ameya Prabhu · Zeynep Akata · Samuel Albanie · Matthias Bethge
|
ExHall D Poster #445 | |
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Poster Session 3
Sanghwan Kim · Rui Xiao · Iuliana Georgescu · Stephan Alaniz · Zeynep Akata
|
ExHall D Poster #387 | |
FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification
Poster Session 3
Zhengrui Guo · Conghao Xiong · Jiabo MA · Qichen Sun · Lishuang Feng · Jinzhuo Wang · Hao Chen
|
ExHall D Poster #473 | |
Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration
Poster Session 6
Jiani Ni · He Zhao · Jintong Gao · Dandan Guo · Hongyuan Zha
|
ExHall D Poster #437 | |
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM
Poster Session 4
Wang Jiarui · Huiyu Duan · Guangtao Zhai · Juntong Wang · Xiongkuo Min
|
ExHall D Poster #294 | |
Shadow Generation Using Diffusion Model with Geometry Prior
Poster Session 2
Haonan Zhao · Qingyang Liu · Xinhao Tao · Li Niu · Guangtao Zhai
|
ExHall D Poster #213 | |
FineVQ: Fine-Grained User Generated Content Video Quality Assessment
Huiyu Duan · Qiang Hu · Wang Jiarui · Liu Yang · Zitong Xu · Lu Liu · Xiongkuo Min · Chunlei Cai · Tianxiao Ye · Xiaoyun Zhang · Guangtao Zhai
|
ExHall D Poster #291 | |
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification
Poster Session 4
Jianwei Zhao · XIN LI · Fan Yang · Qiang Zhai · Ao Luo · Yang Zhao · Hong Cheng · Huazhu Fu
|
ExHall D Poster #474 | |
Towards Training-free Anomaly Detection with Vision and Language Foundation Models
Poster Session 3
Jinjin Zhang · Guodong Wang · yizhou jin · Di Huang
|
ExHall D Poster #436 | |
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
Poster Session 2
Ziheng Zhang · Jianyang Gu · Arpita Chowdhury · Zheda Mai · David Carlyn · Tanya Berger-Wolf · Yu Su · Wei-Lun Chao
|
ExHall D Poster #404 | |
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis
Poster Session 1
Arpita Chowdhury · Dipanjyoti Paul · Zheda Mai · Jianyang Gu · Ziheng Zhang · Kazi Sajeed Mehrab · Elizabeth Campolongo · Daniel Rubenstein · Charles Stewart · Anuj Karpatne · Tanya Berger-Wolf · Yu Su · Wei-Lun Chao
|
ExHall D Poster #404 | |
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
Poster Session 4
Jiaqi Liu · Jichao Zhang · Paolo Rota · Nicu Sebe
|
ExHall D Poster #15 | |
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models
Poster Session 6
Jianlong Jin · Chenglong Zhao · Ruixin Zhang · Sheng Shang · Jianqing Xu · Jingyun Zhang · ShaoMing Wang · Yang Zhao · Shouhong Ding · Wei Jia · Yunsheng Wu
|
ExHall D Poster #17 | |
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
Poster Session 5
Shangzhan Zhang · Jianyuan Wang · Yinghao Xu · Nan Xue · Christian Rupprecht · Xiaowei Zhou · Yujun Shen · Gordon Wetzstein
|
ExHall D Poster #84 | |
Nested Diffusion Models Using Hierarchical Latent Priors
Poster Session 1
Xiao Zhang · Ruoxi Jiang · Rebecca Willett · Michael Maire
|
ExHall D Poster #224 | |
High-Fidelity Lightweight Mesh Reconstruction from Point Clouds
Chen Zhang · Wentao Wang · Ximeng Li · Xinyao Liao · Wanjuan Su · Wenbing Tao
|
ExHall D Poster #105 | |
CADDreamer: CAD Object Generation from Single-view Images
Yuan Li · Cheng Lin · Yuan Liu · Xiaoxiao Long · Chenxu Zhang · Ningna Wang · Xin Li · Wenping Wang · Xiaohu Guo
|
ExHall D Poster #38 | |
Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game
Poster Session 2
Keyizhi Xu · Chi Zhang · Zhan Chen · Zhongyuan Wang · Chunxia Xiao · Chao Liang
|
ExHall D Poster #466 | |
MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting
Poster Session 3
Mengqiu XU · Kaixin Chen · Heng Guo · Yixiang Huang · Ming Wu · Zhenwei Shi · Chuang Zhang · Jun Guo
|
ExHall D Poster #190 | |
Localizing Events in Videos with Multimodal Queries
Poster Session 1
Gengyuan Zhang · Mang Ling Ada Fok · Jialu Ma · Yan Xia · Philip H.S. Torr · Daniel Cremers · Volker Tresp · Jindong Gu
|
ExHall D Poster #303 | |
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models
Poster Session 6
Haokun Chen · Hang Li · Yao Zhang · Jinhe Bi · Gengyuan Zhang · Yueqi Zhang · Philip H.S. Torr · Jindong Gu · Denis Krompaß · Volker Tresp
|
ExHall D Poster #411 | |
From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting
Poster Session 6
Zhiwei Huang · Hailin Yu · Yichun Shentu · Jin Yuan · Guofeng Zhang
|
ExHall D Poster #87 | |
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space
Jianrong Zhang · Hehe Fan · Yi Yang
|
ExHall D Poster #171 | |
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
Poster Session 3
Xiyue Guo · Jiarui Hu · Junjie Hu · Hujun Bao · Guofeng Zhang
|
ExHall D Poster #124 | |
StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Poster Session 6
Shangjin Zhai · Zhichao Ye · Jialin Liu · Weijian Xie · Jiaqi Hu · Zhen Peng · Hua Xue · Danpeng Chen · Xiaomeng Wang · Lei Yang · Nan Wang · Haomin Liu · Guofeng Zhang
|
ExHall D Poster #65 | |
ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping
Youxin Pang · Ruizhi Shao · Jiajun Zhang · Hanzhang Tu · Yun Liu · Boyao Zhou · Hongwen Zhang · Yebin Liu
|
ExHall D Poster #150 | |
Interpreting Object-level Foundation Models via Visual Precision Search
Poster Session 6
Ruoyu Chen · Siyuan Liang · Jingzhi Li · Shiming Liu · Maosen Li · Zhen Huang · Hua Zhang · Xiaochun Cao
|
ExHall D Poster #372 | |
Query Efficient Black-Box Visual Prompting with Subspace Learning
Poster Session 1
Haozhen Zhang · Zhaogeng Liu · Hualin Zhang · Xingchen Li · Wanli Shi · Bin Gu · Yi Chang
|
ExHall D Poster #399 | |
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
Huaxin Zhang · Xiaohao Xu · Xiang Wang · Jialong Zuo · Xiaonan Huang · Changxin Gao · Shanjun Zhang · Li Yu · Nong Sang
|
ExHall D Poster #305 | |
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Poster Session 3
Tian Liu · Huixin Zhang · Shubham Parashar · Shu Kong
|
ExHall D Poster #425 | |
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images
Poster Session 3
Jie Mei · Chenyu Lin · Yu Qiu · Yaonan Wang · Hui Zhang · Ziyang Wang · Dong Dai
|
ExHall D Poster #479 | |
Object-Shot Enhanced Grounding Network for Egocentric Video
Poster Session 5
Yisen Feng · Haoyu Zhang · Meng Liu · Weili Guan · Liqiang Nie
|
ExHall D Poster #303 | |
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
Poster Session 3
Haijie Li · Yanmin Wu · Jiarui Meng · Qiankun Gao · Zhiyao Zhang · Ronggang Wang · Jian Zhang
|
ExHall D Poster #328 | |
Secret Lies in Color: Enhancing AI-Generated Images Detection with Color Distribution Analysis
Poster Session 3
Zexi Jia · Chuanwei Huang · Yeshuang Zhu · Hongyan Fei · Xiaoyue Duan · Yuan Zhiqiang · Ying Deng · Jiapei Zhang · Jinchao Zhang · Jie Zhou
|
ExHall D Poster #267 | |
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
Poster Session 5
Yue Cao · Yun Xing · Jie Zhang · Di Lin · Tianwei Zhang · Ivor Tsang · Yang Liu · Qing Guo
|
ExHall D Poster #383 | |
AVF-MAE++: Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning
Poster Session 2
Xuecheng Wu · Heli Sun · Yifan Wang · Jiayu Nie · Jie Zhang · Yabing Wang · Junxiao Xue · Liang He
|
ExHall D Poster #360 | |
Empowering LLMs to Understand and Generate Complex Vector Graphics
Poster Session 4
XiMing Xing · Juncheng Hu · Guotao Liang · Jing Zhang · Dong Xu · Qian Yu
|
ExHall D Poster #351 | |
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Yuan Zhou · Qingshan Xu · Jiequan Cui · Junbao Zhou · Jing Zhang · Richang Hong · Hanwang Zhang
|
ExHall D Poster #413 | |
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos
Jinglei Zhang · Jiankang Deng · Chao Ma · Rolandos Alexandros Potamias
|
ExHall D Poster #152 | |
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
Poster Session 4
Xiangtao Zhang · Sheng Li · Ao Li · Yipeng Liu · Fan Zhang · Ce Zhu · Le Zhang
|
ExHall D Poster #459 | |
FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation
Kefan Chen · Chaerin Min · Linguang Zhang · Shreyas Hampali · Cem Keskin · Srinath Sridhar
|
ExHall D Poster #158 | |
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
Prithviraj Banerjee · Sindi Shkodrani · Pierre Moulon · Shreyas Hampali · Shangchen Han · Fan Zhang · Linguang Zhang · Jade Fountain · Edward Miller · Selen Basol · Richard Newcombe · Robert Wang · Jakob Engel · Tomas Hodan
|
ExHall D Poster #163 | |
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
Yuanyou Xu · Zongxin Yang · Yi Yang
|
ExHall D Poster #14 | |
Learning Dynamic Collaborative Network for Semi-supervised 3D Vessel Segmentation
Poster Session 2
Jiao Xu · Xin Chen · Lihe Zhang
|
ExHall D Poster #483 | |
Weakly Supervised Contrastive Adversarial Training for Learning Robust Features from Semi-supervised Data
Poster Session 5
Lilin Zhang · Chengpei Wu · Ning Yang
|
ExHall D Poster #448 | |
Neural Hierarchical Decomposition for Single Image Plant Modeling
Poster Session 1
Zhihao Liu · Zhanglin Cheng · Naoto Yokoya
|
ExHall D Poster #53 | |
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning
Poster Session 5
Qianli Ma · Xuefei Ning · Dongrui Liu · Li Niu · Linfeng Zhang
|
ExHall D Poster #212 | |
Diffusion-based Event Generation for High-Quality Image Deblurring
Poster Session 1
Xinan Xie · Qing Zhang · Wei-Shi Zheng
|
ExHall D Poster #190 | |
RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images
Poster Session 2
Junjin Xiao · Qing Zhang · Yongwei Nie · Lei Zhu · Wei-Shi Zheng
|
ExHall D Poster #51 | |
Weakly Supervised Temporal Action Localization via Dual-Prior Collaborative Learning Guided by Multimodal Large Language Models
Poster Session 5
Quan Zhang · Jinwei Fang · Rui Yuan · Xi Tang · Yuxin Qi · Ke Zhang · Chun Yuan
|
ExHall D Poster #298 | |
Towards All-in-One Medical Image Re-Identification
Poster Session 6
Yuan Tian · Kaiyuan Ji · Rongzhao Zhang · Yankai Jiang · Chunyi Li · Xiaosong Wang · Guangtao Zhai
|
ExHall D Poster #443 | |
WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images
Poster Session 3
Shifan Zhang · Hongzi Zhu · Yinan He · Minyi Guo · Ziyang Lou · Shan Chang
|
ExHall D Poster #424 | |
Multi-modal Vision Pre-training for Medical Image Analysis
Poster Session 1
Shaohao Rui · Lingzhi Chen · Zhenyu Tang · Lilong Wang · Mianxin Liu · Shaoting Zhang · Xiaosong Wang
|
ExHall D Poster #478 | |
Rethinking Correspondence-based Category-Level Object Pose Estimation
Poster Session 1
Huan Ren · Wenfei Yang · Shifeng Zhang · Tianzhu Zhang
|
ExHall D Poster #93 | |
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset
Poster Session 6
Xiao Wang · Yu Jin · Wentao Wu · Wei Zhang · Lin Zhu · Bo Jiang · Yonghong Tian
|
ExHall D Poster #303 | |
Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection
Poster Session 4
Jia Guo · Shuai Lu · Weihang Zhang · Fang Chen · Hongen Liao · Huiqi Li
|
ExHall D Poster #438 | |
DreamText: High Fidelity Scene Text Synthesis
Poster Session 6
Yibin Wang · Weizhong Zhang · honghui xu · Cheng Jin
|
ExHall D Poster #228 | |
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
Poster Session 5
Wenyuan Zhang · Yixiao Yang · Han Huang · Liang Han · Kanle Shi · Yu-Shen Liu · Zhizhong Han
|
ExHall D Poster #56 | |
UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition
Poster Session 6
Meng Pang · Wenjun Zhang · Nanrun Zhou · Shengbo Chen · Hong Rao
|
ExHall D Poster #301 | |
SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors
Poster Session 2
Yufan Wu · Xuanhong Chen · Wen Li · Shunran Jia · Hualiang Wei · Kairui Feng · Jialiang CHEN · Yuhan Li · Ang He · Weimin Zhang · Bingbing Ni · Wenjun Zhang
|
ExHall D Poster #12 | |
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
Poster Session 5
Ronghao Dang · Yuqian Yuan · Wenqi Zhang · Yifei Xin · Boqiang Zhang · Long Li · Liuyi Wang · qinyang zeng · Xin Li · Lidong Bing
|
ExHall D Poster #341 | |
Graph-Embedded Structure-Aware Perceptual Hashing for Neural Network Protection and Piracy Detection
Poster Session 4
Ruiheng Liu · Haozhe Chen · Boyao Zhao · Kejiang Chen · Weiming Zhang
|
ExHall D Poster #416 | |
Simulator HC: Regression-based Online Simulation of Starting Problem-Solution Pairs for Homotopy Continuation in Geometric Vision
Xinyue Zhang · Zijia Dai · Wanting Xu · Laurent Kneip
|
ExHall D Poster #91 | |
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer
Poster Session 6
Hongda Liu · Longguang Wang · Ye Zhang · Ziru YU · Yulan Guo
|
ExHall D Poster #220 | |
Progressive Correspondence Regenerator for Robust 3D Registration
Poster Session 1
Guiyu Zhao · Sheng Ao · Ye Zhang · Kai Xu · Yulan Guo
|
ExHall D Poster #97 | |
CoA: Towards Real Image Dehazing via Compression-and-Adaptation
Poster Session 3
Long Ma · Yuxin Feng · Yan Zhang · Jinyuan Liu · Weimin Wang · Guang-Yong Chen · Chengpei Xu · Zhuo Su
|
ExHall D Poster #52 | |
Continuous 3D Perception Model with Persistent State
Poster Session 3
Qianqian Wang · Yifei Zhang · Aleksander Holynski · Alexei A. Efros · Angjoo Kanazawa
|
ExHall D Poster #77 | |
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
Poster Session 4
Leqi Shen · Guoqiang Gong · Tianxiang Hao · Tao He · Yifeng Zhang · Pengzhang Liu · Sicheng Zhao · Jungong Han · Guiguang Ding
|
ExHall D Poster #372 | |
See Further When Clear: Curriculum Consistency Model
Poster Session 4
Yunpeng Liu · Boxiao Liu · Yi Zhang · Xingzhong Hou · Guanglu Song · Yu Liu · Haihang You
|
ExHall D Poster #219 | |
Cross-Rejective Open-Set SAR Image Registration
Poster Session 5
Shasha Mao · Shiming Lu · Zhaolong Du · Licheng Jiao · Shuiping Gou · Luntian Mou · Xuequan Lu · Lin Xiong · Yimeng Zhang
|
ExHall D Poster #187 | |
Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling
Haopeng Sun · Yingwei Zhang · Lumin Xu · Sheng Jin · Ping Luo · Chen Qian · Wentao Liu · Yiqiang Chen
|
ExHall D Poster #453 | |
Enhanced then Progressive Fusion with View Graph for Multi-View Clustering
Poster Session 3
Zhibin Dong · Meng Liu · Siwei Wang · KE LIANG · Yi Zhang · Suyuan Liu · Jiaqi Jin · Xinwang Liu · En Zhu
|
ExHall D Poster #466 | |
Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels
Poster Session 4
Jiyuan Liu · Xinwang Liu · chuankun Li · Xinhang Wan · Hao Tan · Yi Zhang · Weixuan Liang · Qian Qu · Yu Feng · Renxiang Guan · KE LIANG
|
ExHall D Poster #468 | |
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Wenbo Hu · Xiangjun Gao · Xiaoyu Li · Sijie Zhao · Xiaodong Cun · Yong Zhang · Long Quan · Ying Shan
|
ExHall D Poster #171 | |
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Poster Session 6
Yuanhui Huang · Amonnut Thammatadatrakoon · Wenzhao Zheng · Yunpeng Zhang · Dalong Du · Jiwen Lu
|
ExHall D Poster #126 | |
Improving Accuracy and Calibration via Differentiated Deep Mutual Learning
Poster Session 5
Han Liu · Peng Cui · Bingning Wang · Weipeng Chen · Yupeng Zhang · Jun Zhu · Xiaolin Hu
|
ExHall D Poster #459 | |
Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Poster Session 4
Jian Han · Jinlai Liu · Yi Jiang · Bin Yan · Yuqi Zhang · Zehuan Yuan · BINGYUE PENG · Xiaobing Liu
|
ExHall D Poster #248 | |
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Poster Session 1
Zhenghao Zhang · Junchao Liao · Menghao Li · Zuozhuo Dai · Bingxue Qiu · Siyu Zhu · Long Qin · Weizhi Wang
|
ExHall D Poster #178 | |
ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos
Poster Session 5
Zetong Zhang · Manuel Kaufmann · Lixin Xue · Jie Song · Martin R. Oswald
|
ExHall D Poster #74 | |
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
Poster Session 2
Yuanbo Wang · Zhaoxuan Zhang · Jiajin Qiu · Dilong Sun · Zhengyu Meng · Xiaopeng Wei · Xin Yang
|
ExHall D Poster #20 | |
NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation
Poster Session 5
Qi Bi · Jingjun Yi · Huimin Huang · Hao Zheng · Haolan Zhan · Yawen Huang · Yuexiang Li · Xian Wu · Yefeng Zheng
|
ExHall D Poster #270 | |
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Poster Session 3
Zhanhao Liang · Yuhui Yuan · Shuyang Gu · Bohan CHEN · Tiankai Hang · Mingxi Cheng · Ji Li · Liang Zheng
|
ExHall D Poster #243 | |
ActiveGAMER: Active GAussian Mapping through Efficient Rendering
Poster Session 4
Liyan Chen · Huangying Zhan · Kevin Chen · Xiangyu Xu · Qingan Yan · Changjiang Cai · Yi Xu
|
ExHall D Poster #62 | |
LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation
Poster Session 1
Chenxu Zhou · Lvchang Fu · Sida Peng · Yunzhi Yan · Zhanhua Zhang · chen yong · Jiazhi Xia · Xiaowei Zhou
|
ExHall D Poster #128 | |
Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting
Wei Lin · Chenyang ZHAO · Antoni B. Chan
|
ExHall D Poster #307 | |
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
Poster Session 4
Yuqian Yuan · Hang Zhang · Wentong Li · Zesen Cheng · Boqiang Zhang · Long Li · Xin Li · Deli Zhao · Wenqiao Zhang · Yueting Zhuang · Jianke Zhu · Lidong Bing
|
ExHall D Poster #303 | |
From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification
Poster Session 2
Yan Jiang · Hao Yu · Xu Cheng · Haoyu Chen · Zhaodong Sun · Guoying Zhao
|
ExHall D Poster #330 | |
Supervising Sound Localization by In-the-wild Egomotion
Anna Min · Ziyang Chen · Hang Zhao · Andrew Owens
|
ExHall D Poster #279 | |
PhysGen3D: Crafting a Miniature Interactive World from a Single Image
Poster Session 2
Boyuan Chen · Hanxiao Jiang · Shaowei Liu · Saurabh Gupta · Yunzhu Li · Hao Zhao · Shenlong Wang
|
ExHall D Poster #71 | |
StickMotion: Generating 3D Human Motions by Drawing a Stickman
Poster Session 3
Tao Wang · Zhihua Wu · Qiaozhi He · Jiaming Chu · Ling Qian · Yu Cheng · Junliang Xing · Jian Zhao · Lei Jin
|
ExHall D Poster #164 | |
D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation
Poster Session 3
Jichun Zhao · Haiyong Jiang · Haoxuan Song · Jun Xiao · Dong Gong
|
ExHall D Poster #118 | |
Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization
Poster Session 3
long xu · Jiakai Wang · Haojie Hao · Haotong Qin · Jiejie Zhao · Xianglong Liu
|
ExHall D Poster #264 | |
Less is More: Efficient Image Vectorization with Adaptive Parameterization
Poster Session 4
Kaibo Zhao · Liang Bao · Yufei Li · Xu Su · Ke Zhang · Xiaotian Qiao
|
ExHall D Poster #225 | |
HeMoRa: Unsupervised Heuristic Consensus Sampling for Robust Point Cloud Registration
Poster Session 1
Shaocheng Yan · Yiming Wang · Kaiyan Zhao · Pengcheng Shi · Zhenjun Zhao · Yongjun Zhang · Jiayuan Li
|
ExHall D Poster #111 | |
FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering
Poster Session 1
Jingqiu Zhou · Lue Fan · Linjiang Huang · Zhaoxiang Zhang · Xiaoyu Shi · Si Liu · Hongsheng Li
|
ExHall D Poster #129 | |
FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes
Poster Session 3
Lue Fan · Hao ZHANG · Qitai Wang · Hongsheng Li · Zhaoxiang Zhang
|
ExHall D Poster #131 | |
MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing
Poster Session 1
Shuo Wang · Wanting Li · Yongcai Wang · Zhaoxin Fan · Zhe Huang · xudong cai · Jian Zhao · Deying Li
|
ExHall D Poster #101 | |
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations
Poster Session 5
Ziqiao Peng · Yanbo Fan · Haoyu Wu · Xuan Wang · Hongyan Liu · Jun He · Zhaoxin Fan
|
ExHall D Poster #1 | |
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
Zhao Dong · Ka chen · Zhaoyang Lv · Hong-Xing Yu · Yunzhi Zhang · Cheng Zhang · Yufeng Zhu · Stephen Tian · Zhengqin Li · Geordie Moffatt · Sean Christofferson · James Fort · Xiaqing Pan · Mingfei Yan · Jiajun Wu · Carl Ren · Richard Newcombe
|
ExHall D Poster #55 | |
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
Poster Session 1
Xudong LU · Yinghao Chen · chencheng Chen · Hui Tan · Boheng Chen · yina xie · Rui Hu · Guanxin tan · Renshou Wu · Yan Hu · Yi Zeng · Lei Wu · Liuyang Bian · Zhaoxiong Wang · Long Liu · Yanzhou Yang · Han Xiao · Aojun Zhou · Yafei Wen · Xiaoxin Chen · Shuai Ren · Hongsheng Li
|
ExHall D Poster #379 | |
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
Jianyi Wang · Zhijie Lin · Meng Wei · Yang Zhao · Ceyuan Yang · Chen Change Loy · Lu Jiang
|
ExHall D Poster #187 | |
GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking
Poster Session 5
Weikang Bian · Zhaoyang Huang · Xiaoyu Shi · Yijin Li · Fu-Yun Wang · Hongsheng Li
|
ExHall D Poster #63 | |
LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields
Poster Session 1
Zhengqin Li · Dilin Wang · Ka chen · Zhaoyang Lv · Thu Nguyen-Phuoc · Milim Lee · Jia-Bin Huang · Lei Xiao · Yufeng Zhu · Carl Marshall · Carl Ren · Richard Newcombe · Zhao Dong
|
ExHall D Poster #32 | |
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Poster Session 1
Lingen Li · Zhaoyang Zhang · Yaowei Li · Jiale Xu · Wenbo Hu · Xiaoyu Li · Weihao Cheng · Jinwei Gu · Tianfan Xue · Ying Shan
|
ExHall D Poster #57 | |
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
Poster Session 2
Minghong Cai · Xiaodong Cun · Xiaoyu Li · Wenze Liu · Zhaoyang Zhang · Yong Zhang · Ying Shan · Xiangyu Yue
|
ExHall D Poster #229 | |
Identifying and Mitigating Position Bias of Multi-image Vision-Language Models
Poster Session 3
Xinyu Tian · Shu Zou · Zhaoyuan Yang · Jing Zhang
|
ExHall D Poster #376 | |
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Geng Li · Jinglin Xu · Yunzhen Zhao · Yuxin Peng
|
ExHall D Poster #356 | |
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Poster Session 4
Hang Zhou · Xinxin Zuo · Rui Ma · Li Cheng
|
ExHall D Poster #333 | |
DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
Poster Session 5
Yuzhong Zhao · Feng Liu · Yue Liu · Mingxiang Liao · Chen GONG · Qixiang Ye · Fang Wan
|
ExHall D Poster #355 | |
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Feng Liu · Shiwei Zhang · Xiaofeng Wang · Yujie Wei · Haonan Qiu · Yuzhong Zhao · Yingya Zhang · Qixiang Ye · Fang Wan
|
ExHall D Poster #190 | |
Scaling Mesh Generation via Compressive Tokenization
Poster Session 3
Haohan Weng · Zibo Zhao · Biwen Lei · Xianghui Yang · Jian Liu · Zeqiang Lai · Zhuo Chen · Liu Yuhong · Jie Jiang · Chunchao Guo · Tong Zhang · Shenghua Gao · C.L.Philip Chen
|
ExHall D Poster #42 | |
Non-Natural Image Understanding with Advancing Frequency-based Vision Encoders
Poster Session 6
w l · Qingsong Wang · Yueying Feng · Shulei Wang · Tao Jin · Zhou Zhao · Fei Wu · Chang Yao · Jingyuan Chen
|
ExHall D Poster #346 | |
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Poster Session 5
Haifeng Huang · Xinyi Chen · Yilun Chen · Hao Li · Xiaoshen Han · zehan wang · Tai Wang · Jiangmiao Pang · Zhou Zhao
|
ExHall D Poster #141 | |
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
Poster Session 4
Shulei Wang · w l · Hai Huang · Hanting Wang · Sihang Cai · WenKang Han · Tao Jin · Jingyuan Chen · Jiacheng Sun · Jieming Zhu · Zhou Zhao
|
ExHall D Poster #256 | |
Efficient Motion-Aware Video MLLM
Zijia Zhao · Yuqi Huo · Tongtian Yue · Longteng Guo · Haoyu Lu · Bingning Wang · Weipeng Chen · Jing Liu
|
ExHall D Poster #300 | |
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram
Poster Session 6
Sifan Zhou · Zhihang Yuan · Dawei Yang · Ziyu Zhao · Jian Qian · Xing Hu
|
ExHall D Poster #113 | |
OFER: Occluded Face Expression Reconstruction
Poster Session 6
Pratheba Selvaraju · Victoria Abrevaya · Timo Bolkart · Rick Akkerman · Tianyu Ding · Faezeh Amjadi · Ilya Zharkov
|
ExHall D Poster #80 | |
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression
Poster Session 3
Xiaoyi Qu · David Aponte · Colby Banbury · Daniel Robinson · Tianyu Ding · Kazuhito Koishida · Ilya Zharkov · Tianyi Chen
|
ExHall D Poster #439 | |
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
Poster Session 3
Kunyu Wang · Xueyang Fu · Xin Lu · Chengjie Ge · Chengzhi Cao · Wei Zhai · Zheng-Jun Zha
|
ExHall D Poster #419 | |
QMambaBSR: Burst Image Super-Resolution with Query State Space Model
Poster Session 5
Xin Di · Long Peng · Peizhe Xia · Wenbo Li · Renjing Pei · Yang Wang · Yang Cao · Zheng-Jun Zha
|
ExHall D Poster #192 | |
A Lightweight UDF Learning Framework for 3D Reconstruction Based on Local Shape Functions
Poster Session 1
Jiangbei Hu · Yanggeng Li · Fei Hou · Junhui Hou · Zhebin Zhang · Shengfa Wang · Na Lei · Ying He
|
ExHall D Poster #105 | |
Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input
Poster Session 5
Jian Wang · Rishabh Dabral · Diogo Luvizon · Zhe Cao · Lingjie Liu · Thabo Beeler · Christian Theobalt
|
ExHall D Poster #153 | |
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
Poster Session 1
Zhedong Zhang · Liang Li · Chenggang Yan · Chunshan Liu · Anton van den Hengel · Yuankai Qi
|
ExHall D Poster #1 | |
Decoder Gradient Shield: Provable and High-Fidelity Prevention of Gradient-Based Box-Free Watermark Removal
Poster Session 3
Haonan An · Guang Hua · Zhengru Fang · Guowen Xu · Susanto Rahardja · Yuguang Fang
|
ExHall D Poster #265 | |
Synchronized Video-to-Audio Generation via Mel Quantization-Continuum Decomposition
Poster Session 1
Juncheng Wang · Chao Xu · Cheng Yu · Lei Shang · Zhe Hu · Shujun Wang · Liefeng Bo
|
ExHall D Poster #282 | |
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
Poster Session 2
Zhikai Li · Xuewen Liu · Dongrong Joe Fu · Jianquan Li · Qingyi Gu · Kurt Keutzer · Zhen Dong
|
ExHall D Poster #359 | |
NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Bikang Pan · Qun Li · Xiaoying Tang · Wei Huang · Zhen Fang · Feng Liu · Jingya Wang · Jingyi Yu · Ye Shi
|
ExHall D Poster #397 | |
Attention Distillation: A Unified Approach to Visual Characteristics Transfer
Poster Session 4
Yang Zhou · Xu Gao · Zichong Chen · Hui Huang
|
ExHall D Poster #236 | |
SMTPD: A New Benchmark for Temporal Prediction of Social Media Popularity
Poster Session 4
Yijie Xu · Bolun Zheng · Wei Zhu · Hangjia Pan · Yuchen Yao · Ning Xu · An-An Liu · Quan Zhang · Chenggang Yan
|
ExHall D Poster #292 | |
Visual Prompting for One-shot Controllable Video Editing without Inversion
Poster Session 2
Zhengbo Zhang · Yuxi Zhou · DUO PENG · Joo Lim · Zhigang Tu · De Soh Soh · Lin Geng Foo
|
ExHall D Poster #231 | |
X-Dyna: Expressive Dynamic Human Image Animation
Di Chang · Hongyi Xu · You Xie · Yipeng Gao · Zhengfei Kuang · Shengqu Cai · Chenxu Zhang · Guoxian Song · Chao Wang · Yichun Shi · Zeyuan Chen · Shijie Zhou · Linjie Luo · Gordon Wetzstein · Mohammad Soleymani
|
ExHall D Poster #5 | |
Face Forgery Video Detection via Temporal Forgery Cue Unraveling
Poster Session 2
Zonghui Guo · YingJie Liu · Jie Zhang · Haiyong Zheng · Shiguang Shan
|
ExHall D Poster #194 | |
Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images
Poster Session 2
Wensheng Cheng · Zhenghong Li · Jiaxiang Ren · Hyomin Jeong · Congwu Du · Yingtian Pan · Haibin Ling
|
ExHall D Poster #485 | |
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model
Poster Session 6
Zhenglin Huang · Jinwei Hu · Yiwei He · Xiangtai Li · Xiaowei Huang · Bei Peng · Xingyu Zhao · Baoyuan Wu · Guangliang Cheng
|
ExHall D Poster #254 | |
M3amba: Memory Mamba is All You Need for Whole Slide Image Classification
Poster Session 3
Tingting Zheng · Kui Jiang · Yi Xiao · Sicheng Zhao · Hongxun Yao
|
ExHall D Poster #474 | |
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
Poster Session 2
Yexin Liu · Zhengyang Liang · Yueze Wang · Xianfeng Wu · feilong tang · Muyang He · Jian Li · Zheng Liu · Harry Yang · Ser-Nam Lim · Bo Zhao
|
ExHall D Poster #355 | |
Bayesian Prompt Flow Learning for Zero-Shot Anomaly Detection
Poster Session 6
Zhen Qu · Xian Tao · Xinyi Gong · ShiChen Qu · Qiyu Chen · Zhengtao Zhang · Xingang Wang · Guiguang Ding
|
ExHall D Poster #407 | |
NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval
Poster Session 2
Zengrong Lin · Zheng Wang · Tianwen Qian · Pan Mu · Sixian Chan · Cong Bai
|
ExHall D Poster #371 | |
Robust-MVTON: Learning Cross-Pose Feature Alignment and Fusion for Robust Multi-View Virtual Try-On
Poster Session 4
Nannan Zhang · Yijiang Li · Dong Du · Zheng Chong · Zhengwentai Sun · Jianhao Zeng · Yusheng Dai · Zhenyu Xie · Hairui Zhu · Xiaoguang Han
|
ExHall D Poster #16 | |
Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater
Poster Session 1
Xueyu Liu · Rui Wang · Yexin Lai · Guangze Shi · Feixue Shao · Fang Hao · Jianan Zhang · Jia Shen · Yongfei Wu · Wen Zheng
|
ExHall D Poster #400 | |
EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling
Poster Session 1
Songpengcheng Xia · Yu Zhang · Zhuo Su · Xiaozheng Zheng · Zheng Lv · Guidong Wang · Yongjie Zhang · Qi Wu · Lei Chu · Ling Pei
|
ExHall D Poster #155 | |
Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment
Poster Session 1
Xudong Li · Wenjie Nie · Yan Zhang · Runze Hu · Ke Li · Xiawu Zheng · Liujuan Cao
|
ExHall D Poster #205 | |
ScaleLSD: Scalable Deep Line Segment Detection Streamlined
Poster Session 2
Zeran Ke · Bin Tan · Xianwei Zheng · Yujun Shen · Tianfu Wu · Nan Xue
|
ExHall D Poster #89 | |
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer
Yuzhou Liu · Lingjie Zhu · Hanqiao Ye · Shangfeng Huang · Xiang Gao · Xianwei Zheng · Shuhan Shen
|
ExHall D Poster #111 | |
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Poster Session 3
Guosheng Zhao · Chaojun Ni · Xiaofeng Wang · Zheng Zhu · Xueyang Zhang · Yida Wang · Guan Huang · xinze chen · Boyuan Wang · Youyi Zhang · Wenjun Mei · Xingang Wang
|
ExHall D Poster #132 | |
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
Poster Session 3
Boyuan Wang · Xiaofeng Wang · Chaojun Ni · Guosheng Zhao · Zhiqin Yang · Zheng Zhu · Muyang Zhang · YuKun Zhou · xinze chen · Guan Huang · lihong liu · Xingang Wang
|
ExHall D Poster #166 | |
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Poster Session 2
Runtao Liu · Haoyu Wu · Zheng Ziqiang · Chen Wei · Yingqing He · Renjie Pi · Qifeng Chen
|
ExHall D Poster #252 | |
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
Poster Session 6
Tiehan Fan · Kepan Nan · Rui Xie · Penghao Zhou · Zhenheng Yang · Chaoyou Fu · Xiang Li · Jian Yang · Ying Tai
|
ExHall D Poster #270 | |
Parallelized Autoregressive Visual Generation
Yuqing Wang · Shuhuai Ren · Zhijie Lin · Yujin Han · Haoyuan Guo · Zhenheng Yang · Difan Zou · Jiashi Feng · Xihui Liu
|
ExHall D Poster #220 | |
VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
Poster Session 4
Haiming Zhang · Wending Zhou · Shenzhen The Chinese University of Hongkong · Hong Kong University of Science and Technology · Huawei Technologies Ltd. · Huawei Technologies Ltd. · Huawei Technologies Ltd. · Huawei Technologies Ltd. · Huawei Technologies Ltd. · Shenzhen The Chinese University of Hong Kong
|
ExHall D Poster #129 | |
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
Poster Session 3
Ziheng Ouyang · Zhen Li · Qibin Hou
|
ExHall D Poster #228 | |
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
Poster Session 1
Siyuan Bian · Chenghao Xu · Yuliang Xiu · Artur Grigorev · Zhen Liu · Cewu Lu · Michael J. Black · Yao Feng
|
ExHall D Poster #264 | |
HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Poster Session 1
Fan Yang · Ru Zhen · Jianing Wang · Yanhao Zhang · Haoxiang Chen · Haonan Lu · Sicheng Zhao · Guiguang Ding
|
ExHall D Poster #351 | |
Invisible Backdoor Attack against Self-supervised Learning
Poster Session 5
Hanrong Zhang · Zhenting Wang · Boheng Li · Fulin Lin · Tingxu Han · Mingyu Jin · Chenlu Zhan · Mengnan Du · Hongwei Wang · Shiqing Ma
|
ExHall D Poster #455 | |
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization
Poster Session 2
zefeng zhang · Hengzhu Tang · Jiawei Sheng · Zhenyu Zhang · YiMing Ren · Zhenyang Li · Dawei Yin · Duohe Ma · Tingwen Liu
|
ExHall D Poster #386 | |
DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels
Poster Session 3
Erjian Guo · Zhen Zhao · Zicheng Wang · Tong Chen · YUNYI LIU · Luping Zhou
|
ExHall D Poster #352 | |
MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction
Poster Session 1
Gangjian Zhang · Nanjie Yao · Shunsi Zhang · hanfeng Zhao · Guoliang Pang · Jian Shu · Hao Wang
|
ExHall D Poster #16 | |
CGMatch: A Different Perspective of Semi-supervised Learning
Poster Session 3
Bo Cheng · Jueqing Lu · Yuan Tian · Haifeng Zhao · Yi Chang · Lan Du
|
ExHall D Poster #453 | |
Balanced Rate-Distortion Optimization in Learned Image Compression
Yichi Zhang · Zhihao Duan · Yuning Huang · Fengqing Zhu
|
ExHall D Poster #213 | |
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
Poster Session 3
Yunze Man · De-An Huang · Guilin Liu · Shiwei Sheng · Shilong Liu · Liangyan Gui · Jan Kautz · Yu-Xiong Wang · Zhiding Yu
|
ExHall D Poster #346 | |
HD-EPIC: A Highly-Detailed Egocentric Video Dataset
Poster Session 5
Toby Perrett · Ahmad Darkhalil · Saptarshi Sinha · Omar Emara · Sam Pollard · Kranti Kumar Parida · Kaiting Liu · Prajwal Gatti · Siddhant Bansal · Kevin Flanagan · Jacob Chalk · Zhifan Zhu · Rhodri Guerrier · Fahd Abdelazim · Bin Zhu · Davide Moltisanti · Michael Wray · Hazel Doughty · Dima Damen
|
ExHall D Poster #276 | |
Human-centered Interactive Learning via MLLMs for Text-to-Image Person Re-identification
Poster Session 3
Yang Qin · Chao Chen · Zhihang Fu · Dezhong Peng · Xi Peng · Peng Hu
|
ExHall D Poster #357 | |
SpecTRe-GS: Modeling Highly Specular Surfaces with Reflected Nearby Objects by Tracing Rays in 3D Gaussian Splatting
Jiajun Tang · Fan Fei · Zhihao Li · Xiao Tang · Shiyong Liu · Youyu Chen · Binxiao Huang · Dave Zhenyu Chen · Xiaofei Wu · Boxin Shi
|
ExHall D Poster #27 | |
DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds
Youyu Chen · Junjun Jiang · Kui Jiang · Xiao Tang · Zhihao Li · Xianming Liu · Yinyu Nie
|
ExHall D Poster #47 | |
IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement
Poster Session 6
Zhihao Shi · Dong Huo · Yuhongze Zhou · Yan Min · Juwei Lu · Xinxin Zuo
|
ExHall D Poster #51 | |
Empowering Large Language Models with 3D Situation Awareness
Poster Session 4
Zhihao Yuan · Yibo Peng · Jinke Ren · Yinghong Liao · Yatong Han · Chun-Mei Feng · Hengshuang Zhao · Guanbin Li · Shuguang Cui · Zhen Li
|
ExHall D Poster #346 | |
LEDiff: Latent Exposure Diffusion for HDR Generation
Poster Session 1
Chao Wang · Zhihao Xia · Thomas Leimkuehler · Karol Myszkowski · Xuaner Zhang
|
ExHall D Poster #27 | |
Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable
Poster Session 1
Xin Jin · Simon Niklaus · Zhoutong Zhang · Zhihao Xia · Chun-Le Guo · Yuting Yang · Jiawen Chen · Chongyi Li
|
ExHall D Poster #180 | |
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
Poster Session 1
Xianwei Zhuang · Zhihong Zhu · Yuxin Xie · Liming Liang · Yuexian Zou
|
ExHall D Poster #384 | |
A Unified Approach to Interpreting Self-supervised Pre-training Methods for 3D Point Clouds via Interactions
Qiang Li · Jian Ruan · Fanghao Wu · Yuchi Chen · Zhihua Wei · Wen Shen
|
ExHall D Poster #111 | |
RaSS: Improving Denoising Diffusion Samplers with Reinforced Active Sampling Scheduler
Poster Session 3
Xin Ding · Lei Yu · Xin Li · Zhijun Tu · Hanting Chen · Jie Hu · Zhibo Chen
|
ExHall D Poster #217 | |
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
Poster Session 4
Enguang Wang · Zhimao Peng · Zhengyuan Xie · Fei Yang · Xialei Liu · Ming-Ming Cheng
|
ExHall D Poster #428 | |
Font-Agent: Enhancing Font Understanding with Large Language Models
Poster Session 4
Yingxin Lai · Cuijie Xu · Haitian Shi · Guoqing Yang · Xiaoning Li · Zhiming Luo · Shaozi Li
|
ExHall D Poster #368 | |
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
Poster Session 3
Kaiyu Li · Ruixun Liu · Xiangyong Cao · Xueru Bai · Feng Zhou · Deyu Meng · Wang Zhi
|
ExHall D Poster #319 | |
Dynamic Updates for Language Adaptation in Visual-Language Tracking
Poster Session 4
Xiaohai Li · Bineng Zhong · Qihua Liang · Zhiyi Mo · Jian Nong · Shuxiang Song
|
ExHall D Poster #321 | |
TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting
Poster Session 3
Jianchuan Chen · Jingchuan Hu · Gaige Wang · Zhonghua Jiang · Tiansong Zhou · Zhiwen Chen · Chengfei Lv
|
ExHall D Poster #7 | |
Steepest Descent Density Control for Compact 3D Gaussian Splatting
Poster Session 6
Peihao Wang · Yuehao Wang · Dilin Wang · Sreyas Mohan · Zhiwen Fan · Lemeng Wu · Ruisi Cai · Yu-Ying Yeh · Zhangyang Wang · Qiang Liu · Rakesh Ranjan
|
ExHall D Poster #48 | |
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Poster Session 3
Shijie Zhou · Hui Ren · Yijia Weng · Shuwang Zhang · Zhen Wang · Dejia Xu · Zhiwen Fan · Suya You · Zhangyang Wang · Leonidas Guibas · Achuta Kadambi
|
ExHall D Poster #338 | |
OpenSDI: Spotting Diffusion-Generated Images in the Open World
Poster Session 1
Yabin Wang · Zhiwu Huang · Xiaopeng Hong
|
ExHall D Poster #393 | |
Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration
Poster Session 4
yuxuan Gu · Huaian Chen · Yi Jin · Haoxuan Wang · Pengyang Ling · ZHIXIANG WEI · Enhong Chen
|
ExHall D Poster #20 | |
Teaching Large Language Models to Regress Accurate Image Quality Scores Using Score Distribution
Poster Session 3
Zhiyuan You · Xin Cai · Jinjin Gu · Tianfan Xue · Chao Dong
|
ExHall D Poster #366 | |
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
Zixuan Chen · Yujin Wang · Xin Cai · Zhiyuan You · Zhe-Ming Lu · Fan Zhang · Shi Guo · Tianfan Xue
|
ExHall D Poster #25 | |
Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation
Poster Session 4
Kendong Liu · Zhiyu Zhu · Hui LIU · Junhui Hou
|
ExHall D Poster #212 | |
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
Poster Session 2
Xuewu Lin · Tianwei Lin · Alan Huang · HONGYU XIE · Zhizhong Su
|
ExHall D Poster #348 | |
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Poster Session 5
Qihang Peng · Henry Zheng · Gao Huang
|
ExHall D Poster #340 | |
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Poster Session 5
Zhipeng Huang · Shaobin Zhuang · Canmiao Fu · Binxin Yang · Ying Zhang · Chong Sun · Chen Li · Yali Wang · Zhizheng Zhang · Zheng-Jun Zha
|
ExHall D Poster #253 | |
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Poster Session 3
Haoyi Jiang · Liu Liu · Tianheng Cheng · Xinjie wang · Tianwei Lin · Zhizhong Su · Wenyu Liu · Xinggang Wang
|
ExHall D Poster #127 | |
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
Poster Session 5
Songsong Yu · Yuxin Chen · Zhongang Qi · Zeke Xie · Yifan Wang · Lijun Wang · Ying Shan · Huchuan Lu
|
ExHall D Poster #76 | |
Dense-To-Sparse Video Diffusion For High-fidelity Multi-View Images Synthesis
Poster Session 4
Fan Yang · Jianfeng Zhang · Jun Hao Liew · Chaoyue Song · Zhongcong Xu · Xiu Li · Jiashi Feng · Guosheng Lin
|
ExHall D Poster #59 | |
Rethinking Personalized Aesthetics Assessment: Employing Physique Aesthetics Assessment as An Exemplification
Haobin Zhong · Shuai He · Anlong Ming · Huadong Ma
|
ExHall D Poster #265 | |
Universal Actions for Enhanced Embodied Foundation Models
Poster Session 5
Jinliang Zheng · Jianxiong Li · Dongxiu Liu · Yinan Zheng · Zhihao Wang · Zhonghong Ou · Yu Liu · Jingjing Liu · Ya-Qin Zhang · Xianyuan Zhan
|
ExHall D Poster #138 | |
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video
Poster Session 1
Qiang Hu · Zihan Zheng · Houqiang Zhong · Sihua Fu · Li Song · Xiaoyun Zhang · Guangtao Zhai · Yanfeng Wang
|
ExHall D Poster #66 | |
FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation
Poster Session 1
Tianyun Zhong · Chao Liang · Jianwen Jiang · Gaojie Lin · Jiaqi Yang · Zhou Zhao
|
ExHall D Poster #281 | |
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
Poster Session 6
Zhongwei Ren · Yunchao Wei · Xun Guo · Yao Zhao · Bingyi Kang · Jiashi Feng · Xiaojie Jin
|
ExHall D Poster #276 | |
Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory
Poster Session 5
Wenliang Zhong · Haoyu Tang · Qinghai Zheng · Mingzhu Xu · Yupeng Hu · Weili Guan
|
ExHall D Poster #434 | |
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
Poster Session 6
Zhaoxing Gan · Mengtian Li · Ruhua Chen · Zhongxia JI · Sichen Guo · Huanling Hu · Guangnan Ye · Zuo Hu
|
ExHall D Poster #242 | |
STAA-SNN: Spatial-Temporal Attention Aggregator for Spiking Neural Networks
Poster Session 3
Tianqing Zhang · Kairong Yu · Xian Zhong · Hongwei Wang · Qi Xu · Qiang Zhang
|
ExHall D Poster #316 | |
Anomize: Better Open Vocabulary Video Anomaly Detection
Poster Session 6
Fei Li · Wenxuan Liu · Jingjing Chen · Ruixu Zhang · Yuran Wang · Xian Zhong · Zheng Wang
|
ExHall D Poster #292 | |
HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery
Poster Session 5
Jingtao Li · Yingyi Liu · XINYU WANG · Yunning Peng · Chen Sun · Shaoyu Wang · Zhendong Sun · Tian Ke · Xiao Jiang · Tangwei Lu · Anran Zhao · Yanfei Zhong
|
ExHall D Poster #189 | |
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors
Poster Session 1
An Li · Zhe Zhu · Mingqiang Wei
|
ExHall D Poster #106 | |
Task-Agnostic Guided Feature Expansion for Class-Incremental Learning
Poster Session 2
Bowen Zheng · Da-Wei Zhou · Han-Jia Ye · De-Chuan Zhan
|
ExHall D Poster #450 | |
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
Poster Session 2
Enshen Zhou · Qi Su · Cheng Chi · Zhizheng Zhang · Zhongyuan Wang · Tiejun Huang · Lu Sheng · He Wang
|
ExHall D Poster #148 | |
Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation
Poster Session 4
Pu Cao · Feng Zhou · Lu Yang · TianruiHuang · Qing Song
|
ExHall D Poster #244 | |
Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation
Poster Session 1
Hao Li · Ju Dai · Xin Zhao · Feng Zhou · Junjun Pan · Lei Li
|
ExHall D Poster #2 | |
Navigation World Models
Poster Session 4
Amir Bar · Gaoyue Zhou · Danny Tran · Trevor Darrell · Yann LeCun
|
ExHall D Poster #396 | |
LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors
Poster Session 5
Han Zhou · Wei Dong · Jun Chen
|
ExHall D Poster #50 | |
CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design
Weitao Feng · Hang Zhou · Jing Liao · Li Cheng · Wenbo Zhou
|
ExHall D Poster #289 | |
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection
Poster Session 4
Xinjie Cui · Yuezun Li · Ao Luo · Jiaran Zhou · Junyu Dong
|
ExHall D Poster #325 | |
Consistency-aware Self-Training for Iterative-based Stereo Matching
Poster Session 4
Jingyi Zhou · Peng Ye · Haoyu Zhang · Jiakang Yuan · Rao Qiang · Liu YangChenXu · Wu Cailin · Feng Xu · Tao Chen
|
ExHall D Poster #77 | |
Unlearning through Knowledge Overwriting: Reversible Federated Unlearning via Selective Sparse Adapter
Poster Session 6
Zhengyi Zhong · Weidong Bao · Ji Wang · Shuai Zhang · Jingxuan Zhou · Lingjuan Lyu · Wei Yang Bryan Lim
|
ExHall D Poster #432 | |
MLVU: Benchmarking Multi-task Long Video Understanding
Poster Session 3
Junjie Zhou · Yan Shu · Bo Zhao · Boya Wu · Zhengyang Liang · Shitao Xiao · Minghao Qin · Xi Yang · yongping xiong · Bo Zhang · Tiejun Huang · Zheng Liu
|
ExHall D Poster #291 | |
OmniGen: Unified Image Generation
Poster Session 3
Shitao Xiao · Yueze Wang · Junjie Zhou · Huaying Yuan · Xingrun Xing · Ruiran Yan · Chaofan Li · Shuting Wang · Tiejun Huang · Zheng Liu
|
ExHall D Poster #252 | |
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
Poster Session 6
Yan Shu · Zheng Liu · Peitian Zhang · Minghao Qin · Junjie Zhou · Zhengyang Liang · Tiejun Huang · Bo Zhao
|
ExHall D Poster #339 | |
NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction
Wenyuan Zhang · Emily Yue-ting Jia · Junsheng Zhou · Baorui Ma · Kanle Shi · Yu-Shen Liu · Zhizhong Han
|
ExHall D Poster #63 | |
Maintaining Consistent Inter-Class Topology in Continual Test-Time Adaptation
Poster Session 3
Chenggong Ni · Fan Lyu · Jiayao Tan · Fuyuan Hu · Rui Yao · Tao Zhou
|
ExHall D Poster #447 | |
GraphMimic: Graph-to-Graphs Generative Modeling from Videos for Policy Learning
Poster Session 1
Guangyan Chen · Te Cui · Meiling Wang · Yang Chengcai · Mengxiao Hu · Haoyang Lu · Yao Mu · Zicai Peng · Tianxing Zhou · XINRAN JIANG · Yi Yang · Yufeng Yue
|
ExHall D Poster #148 | |
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Poster Session 5
Jiuhai Chen · Jianwei Yang · Haiping Wu · Dianqi Li · Jianfeng Gao · Tianyi Zhou · Bin Xiao
|
ExHall D Poster #372 | |
HybridMQA: Exploring Geometry-Texture Interactions for Colored Mesh Quality Assessment
Poster Session 5
Armin Shafiee Sarvestani · Sheyang Tang · Zhou Wang
|
ExHall D Poster #35 | |
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
Poster Session 6
Tianyi Yan · Dongming Wu · Wencheng Han · Junpeng Jiang · xia zhou · Kun Zhan · Cheng-Zhong Xu · Jianbing Shen
|
ExHall D Poster #131 | |
DEIM: DETR with Improved Matching for Fast Convergence
Poster Session 3
Shihua Huang · Zhichao Lu · Xiaodong Cun · Yongjun YU · Xiao Zhou · Xi Shen
|
ExHall D Poster #432 | |
GBC-Splat: Generalizable Gaussian-Based Clothed Human Digitalization under Sparse RGB Cameras
Poster Session 6
Hanzhang Tu · Zhanfeng Liao · Boyao Zhou · Shunyuan Zheng · Xilong Zhou · Liuxin ZHANG · QianYing Wang · Yebin Liu
|
ExHall D Poster #18 | |
Implicit Correspondence Learning for Image-to-Point Cloud Registration
Xinjun Li · Wenfei Yang · Jiacheng Deng · Zhixin Cheng · Xu Zhou · Tianzhu Zhang
|
ExHall D Poster #106 | |
Generative Map Priors for Collaborative BEV Semantic Segmentation
Poster Session 3
Jiahui Fu · Yue Gong · Luting Wang · Shifeng Zhang · Xu Zhou · Si Liu
|
ExHall D Poster #123 | |
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
Poster Session 2
Shaofei Huang · Rui Ling · Tianrui Hui · Hongyu Li · Xu Zhou · Shifeng Zhang · Si Liu · Richang Hong · Meng Wang
|
ExHall D Poster #285 | |
Visual Lexicon: Rich Image Features in Language Space
Poster Session 4
XuDong Wang · Xingyi Zhou · Alireza Fathi · Trevor Darrell · Cordelia Schmid
|
ExHall D Poster #375 | |
Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
Yu Zhou · Dian Zheng · Qijie Mo · Ren-Jie Lu · Kun-Yu Lin · Wei-Shi Zheng
|
ExHall D Poster #433 | |
nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation Benchmark
Poster Session 4
Yanfeng Zhou · Lingrui Li · Le Lu · Minfeng Xu
|
ExHall D Poster #480 | |
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
Poster Session 3
Bo Tong · Bokai Lai · Yiyi Zhou · Luo · Yunhang Shen · Ke Li · Xiaoshuai Sun · Rongrong Ji
|
ExHall D Poster #375 | |
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Poster Session 3
Yushuai Sun · Zikun Zhou · Dongmei Jiang · Yaowei Wang · Jun Yu · Guangming Lu · Wenjie Pei
|
ExHall D Poster #441 | |
MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Xinqi Liu · Li Zhou · Zikun Zhou · Jianqiu Chen · Zhenyu He
|
ExHall D Poster #321 | |
HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Poster Session 1
Zitang Zhou · Ke Mei · Yu Lu · Tianyi Wang · Fengyun Rao
|
ExHall D Poster #286 | |
Towards General Visual-Linguistic Face Forgery Detection
Poster Session 4
Ke Sun · Shen Chen · Taiping Yao · Ziyin Zhou · Jiayi Ji · Xiaoshuai Sun · Chia-Wen Lin · Rongrong Ji
|
ExHall D Poster #359 | |
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
Poster Session 2
Junfeng Ni · Yu Liu · Ruijie Lu · ZiRui Zhou · Song-Chun Zhu · Yixin Chen · Siyuan Huang
|
ExHall D Poster #55 | |
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh
Poster Session 1
Jingyu Zhuang · Di Kang · Linchao Bao · Liang Lin · Guanbin Li
|
ExHall D Poster #12 | |
Enhancing Diversity for Data-free Quantization
Poster Session 5
Kai Zhao · zhihao zhuang · Miao Zhang · Chenjuan Guo · Yang Shu · Bin Yang
|
ExHall D Poster #425 | |
Enduring, Efficient and Robust Trajectory Prediction Attack in Autonomous Driving via Optimization-Driven Multi-Frame Perturbation Framework
Yi Yu · Weizhen Han · Libing Wu · Bingyi Liu · Enshu Wang · Zhuangzhuang Zhang
|
ExHall D Poster #135 | |
Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction
Poster Session 5
Kaixin Fan · Pengfei Ren · Jingyu Wang · Haifeng Sun · Qi Qi · Zirui Zhuang · Jianxin Liao
|
ExHall D Poster #149 | |
PolarNeXt: Rethink Instance Segmentation with Polar Representation
Poster Session 4
Jiacheng Sun · Xinghong Zhou · Yiqiang Wu · Bin Zhu · Jiaxuan Lu · Yu Qin · Xiaomao Li
|
ExHall D Poster #335 | |
CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
Poster Session 2
Yuxuan Sun · Yixuan Si · Chenglu Zhu · Xuan Gong · Kai Zhang · Pingyi Chen · Ye Zhang · Zhongyi Shui · Tao Lin · Lin Yang
|
ExHall D Poster #475 | |
VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis
Poster Session 3
Zhifeng Wang · Renjiao Yi · Xin Wen · Chenyang Zhu · Kai Xu
|
ExHall D Poster #483 | |
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
Poster Session 1
Yijie Tang · Jiazhao Zhang · Yuqing Lan · Yulan Guo · Dezun Dong · Chenyang Zhu · Kai Xu
|
ExHall D Poster #334 | |
Test-Time Backdoor Detection for Object Detection Models
Poster Session 5
Hangtao Zhang · Yichen Wang · Shihui Yan · Chenyu Zhu · Ziqi Zhou · Linshan Hou · Shengshan Hu · Minghui Li · Yanjun Zhang · Leo Yu Zhang
|
ExHall D Poster #320 | |
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
Poster Session 6
Hao Li · Changyao TIAN · Jie Shao · Xizhou Zhu · Zhaokai Wang · Jinguo Zhu · Wenhan Dou · Xiaogang Wang · Hongsheng Li · Lewei Lu · Jifeng Dai
|
ExHall D Poster #347 | |
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
Poster Session 2
Yifan Chang · Junjie Huang · Xiaofeng Wang · Yun Ye · Zhujin LIANG · Yi Shan · Dalong Du · Xingang Wang
|
ExHall D Poster #137 | |
Continual SFT Matches Multimodal RLHF with Negative Supervision
Poster Session 3
Ke Zhu · Yu Wang · Yanpeng Sun · Qiang Chen · Jiang-Jiang Liu · gang zhang · Jingdong Wang
|
ExHall D Poster #380 | |
Quantization without Tears
Poster Session 1
Minghao Fu · Hao Yu · Jie Shao · Junjie Zhou · Ke Zhu · Jianxin Wu
|
ExHall D Poster #412 | |
OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary
Poster Session 6
Yifeng Yang · Lin Zhu · Zewen Sun · Hengyu Liu · Qinying Gu · Nanyang Ye
|
ExHall D Poster #429 | |
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
Poster Session 4
Junlong Cheng · Bin Fu · Jin Ye · Guoan Wang · Tianbin Li · Haoyu Wang · Ruoyu Li · He Yao · Chen Junren · Jingwen Li · Yanzhou Su · Min Zhu · Junjun He
|
ExHall D Poster #479 | |
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
Poster Session 1
Muzhi Zhu · Yuzhuo Tian · Hao Chen · Chunluan Zhou · Qingpei Guo · Yang Liu · Ming Yang · Chunhua Shen
|
ExHall D Poster #335 | |
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
Poster Session 6
Songhao Han · Wei Huang · Hairong Shi · Le Zhuo · Xiu Su · Shifeng Zhang · Xu Zhou · Xiaojuan Qi · Yue Liao · Si Liu
|
ExHall D Poster #266 | |
LongDiff: Training-Free Long Video Generation in One Go
Poster Session 4
Zhuoling Li · Hossein Rahmani · Qiuhong Ke · Jun Liu
|
ExHall D Poster #189 | |
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Poster Session 3
Zhuoman Liu · Weicai Ye · Yan Luximon · Pengfei Wan · Di ZHANG
|
ExHall D Poster #35 | |
Monocular and Generalizable Gaussian Talking Head Animation
Poster Session 2
Shengjie Gong · Haojie Li · Jiapeng Tang · Dongming Hu · Shuangping Huang · Hao Chen · Tianshui Chen · Zhuoman Liu
|
ExHall D Poster #7 | |
Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation
Poster Session 3
Zhuoran ZHAO · Linlin Yang · Pengzhan Sun · Pan Hui · Angela Yao
|
ExHall D Poster #154 | |
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Poster Session 2
Kang Liu · Zhuoqi Ma · Xiaolu Kang · Yunan Li · Kun XIE · Zhicheng Jiao · Qiguang Miao
|
ExHall D Poster #474 | |
Neural Video Compression with Context Modulation
Poster Session 3
Chuanbo Tang · Zhuoyuan Li · Yifan Bian · Li Li · Dong Liu
|
ExHall D Poster #181 | |
DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Junjie Zhou · Shouju Wang · Yuxia Tang · Qi Zhu · Daoqiang Zhang · WEI SHAO
|
ExHall D Poster #453 | |
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Poster Session 1
Runsong Zhu · Shi Qiu · ZHENGZHE LIU · Ka-Hei Hui · Qianyi Wu · Pheng-Ann Heng · Chi-Wing Fu
|
ExHall D Poster #332 | |
HRAvatar: High-Quality and Relightable Gaussian Head Avatar
Poster Session 6
Dongbin Zhang · Yunfei Liu · Lijian Lin · Ye Zhu · Kangjie Chen · Minghan Qin · Yu Li · Haoqian Wang
|
ExHall D Poster #8 | |
AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Poster Session 2
Shijie Wu · Yihang Zhu · Yunao Huang · Kaizhen Zhu · Jiayuan Gu · Jingyi Yu · Ye Shi · Jingya Wang
|
ExHall D Poster #153 | |
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Poster Session 4
Jianwen Jiang · Gaojie Lin · Zhengkun Rong · Chao Liang · Yongming Zhu · Jiaqi Yang · Tianyun Zhong
|
ExHall D Poster #6 | |
MITracker: Multi-View Integration for Visual Object Tracking
Mengjie Xu · Yitao Zhu · Haotian Jiang · Jiaming Li · Zhenrong Shen · Sheng Wang · Haolin Huang · Xinyu Wang · Han Zhang · Qing Yang · Qian Wang
|
ExHall D Poster #98 | |
Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions
Poster Session 4
Tianhao Ma · Han Chen · Juncheng Hu · Yungang Zhu · Ximing Li
|
ExHall D Poster #455 | |
Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection
Yun Zhu · Le Hui · Hang Yang · Jianjun Qian · Jin Xie · Jian Yang
|
ExHall D Poster #432 | |
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion
Poster Session 4
Yang Wu · Yun Zhu · Kaihua Zhang · Jianjun Qian · Jin Xie · Jian Yang
|
ExHall D Poster #115 | |
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation
Xiao Cui · Yulei Qin · Wengang Zhou · Hongsheng Li · Houqiang Li
|
ExHall D Poster #440 | |
I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models
Poster Session 3
Dongnan Gui · Xun Guo · Wengang Zhou · Yan Lu
|
ExHall D Poster #186 | |
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Poster Session 3
Zhiyang Guo · Jinxu Xiang · Kai Ma · Wengang Zhou · Houqiang Li · Ran Zhang
|
ExHall D Poster #12 | |
SmartEraser: Remove Anything from Images using Masked-Region Guidance
Poster Session 5
Longtao Jiang · Zhendong Wang · Jianmin Bao · Wengang Zhou · Dongdong Chen · Lei Shi · Dong Chen · Houqiang Li
|
ExHall D Poster #327 | |
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
Poster Session 5
Zhendong Wang · Jianmin Bao · Shuyang Gu · Dong Chen · Wengang Zhou · Houqiang Li
|
ExHall D Poster #239 | |
MotionPro: A Precise Motion Controller for Image-to-Video Generation
Poster Session 6
Zhongwei Zhang · Fuchen Long · Zhaofan Qiu · Yingwei Pan · Wu Liu · Ting Yao · Tao Mei
|
ExHall D Poster #170 | |
Making Old Film Great Again: Degradation-aware State Space Model for Old Film Restoration
Poster Session 6
Yudong Mao · Hao Luo · Zhiwei Zhong · Peilin CHEN · Zhijiang Zhang · Shiqi Wang
|
ExHall D Poster #178 | |
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Lei Li · wei yuancheng · Zhihui Xie · Xuqing Yang · Yifan Song · Peiyi Wang · Chenxin An · Tianyu Liu · Sujian Li · Bill Yuchen Lin · Lingpeng Kong · Qi Liu
|
ExHall D Poster #347 | |
Cross-Modal 3D Representation with Multi-View Images and Point Clouds
Poster Session 1
Ziyang Zhou · Pinghui Wang · Zi Liang · Haitao Bai · Ruofei Zhang
|
ExHall D Poster #339 | |
Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Instructional Videos
Sagnik Majumder · Tushar Nagarajan · Ziad Al-Halah · Reina Pradhan · Kristen Grauman
|
ExHall D Poster #275 | |
SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language
Poster Session 6
zehan wang · Sashuai zhou · Shaoxuan He · Haifeng Huang · Lihe Yang · Ziang Zhang · Xize Cheng · Shengpeng Ji · Tao Jin · Hengshuang Zhao · Zhou Zhao
|
ExHall D Poster #336 | |
Diffusion Renderer: Neural Inverse and Forward Rendering with Video Diffusion Models
Poster Session 6
Ruofan Liang · Žan Gojčič · Huan Ling · Jacob Munkberg · Jon Hasselgren · Chih-Hao Lin · Jun Gao · Alexander Keller · Nandita Vijaykumar · Sanja Fidler · Zian Wang
|
ExHall D Poster #29 | |
Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling
Poster Session 5
Xingyu Chen · Zihao Feng · Kun Qian · Xinyu Zhang
|
ExHall D Poster #28 | |
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Poster Session 6
Hao Chen · Ze Wang · Xiang Li · Ximeng Sun · Fangyi Chen · Jiang Liu · Jindong Wang · Bhiksha Raj · Zicheng Liu · Emad Barsoum
|
ExHall D Poster #209 | |
FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
Poster Session 1
Yufan Ren · Zicong Jiang · Tong Zhang · Søren Forchhammer · Sabine Süsstrunk
|
ExHall D Poster #238 | |
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Poster Session 3
Lei Wang · Senmao Li · Fei Yang · Jianye Wang · Ziheng Zhang · Yuhan Liu · Yaxing Wang · Jian Yang
|
ExHall D Poster #213 | |
LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds
Poster Session 1
Zihui Zhang · Weisheng Dai · Hongtao Wen · Bo Yang
|
ExHall D Poster #112 | |
Learning Flow Fields in Attention for Controllable Person Image Generation
Poster Session 1
Zijian Zhou · Shikun Liu · Xiao Han · Haozhe Liu · Kam Woh Ng · Tian Xie · Yuren Cong · Hang Li · Mengmeng Xu · Juan-Manuel Pérez-Rúa · Aditya Patel · Tao Xiang · Miaojing Shi · Sen He
|
ExHall D Poster #223 | |
MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Zhi-Yuan Zhang · Xiaofan Li · Zhihao Xu · Wenjie Peng · Zijian Zhou · Miaojing Shi · Shuangping Huang
|
ExHall D Poster #139 | |
Dual Diffusion for Unified Image Generation and Understanding
Poster Session 1
Zijie Li · Henry Li · Yichun Shi · Amir Barati Farimani · Yuval Kluger · Linjie Yang · Peng Wang
|
ExHall D Poster #251 | |
Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection
Jiahao Xu · Zikai Zhang · Rui Hu
|
ExHall D Poster #461 | |
ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling
Poster Session 1
Zikang Zhou · Hengjian Zhou · Haibo Hu · Zihao WEN · Jianping Wang · Yung-Hui Li · Yu-Kai Huang
|
ExHall D Poster #135 | |
Iterative Predictor-Critic Code Decoding for Real-World Image Dehazing
Poster Session 3
Jiayi Fu · Siyu Liu · Zikun Liu · Chun-Le Guo · Hyunhee Park · Rui-Qi Wu · Guoqing Wang · Chongyi Li
|
ExHall D Poster #196 | |
Open Ad-hoc Categorization with Contextualized Feature Learning
Poster Session 3
Zilin Wang · Sangwoo Mo · Stella X. Yu · Sima Behpour · Liu Ren
|
ExHall D Poster #427 | |
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Poster Session 2
Lianghui Zhu · Zilong Huang · Bencheng Liao · Jun Hao Liew · Hanshu Yan · Jiashi Feng · Xinggang Wang
|
ExHall D Poster #219 | |
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sili Chen · Hengkai Guo · Shengnan Zhu · Feihu Zhang · Zilong Huang · Jiashi Feng · Bingyi Kang
|
ExHall D Poster #169 | |
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Poster Session 5
Zilyu Ye · Zhiyang Chen · Tiancheng Li · Zemin Huang · Weijian Luo · Guo-Jun Qi
|
ExHall D Poster #225 | |
FG^2: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching
Poster Session 2
Zimin Xia · Alex Alahi
|
ExHall D Poster #94 | |
CoMatcher: Multi-View Collaborative Feature Matching
Poster Session 5
Jintao Zhang · Zimin Xia · Mingyue Dong · Shuhan Shen · Linwei Yue · Xianwei Zheng
|
ExHall D Poster #88 | |
HotSpot: Signed Distance Function Optimization with an Asymptotically Sufficient Condition
Zimo Wang · Cheng Wang · Taiki Yoshino · Sirui Tao · Ziyang Fu · Tzu-Mao Li
|
ExHall D Poster #103 | |
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Poster Session 1
Zhenyu Wu · Yuheng Zhou · Xiuwei Xu · Ziwei Wang · Haibin Yan
|
ExHall D Poster #144 | |
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
Poster Session 4
Hang Yin · Xiuwei Xu · Linqing Zhao · Ziwei Wang · Jie Zhou · Jiwen Lu
|
ExHall D Poster #311 | |
DeformCL: Learning Deformable Centerline Representation for Vessel Extraction in 3D Medical Image
Poster Session 6
Ziwei Zhao · Zhixing Zhang · Yuhang Liu · Zhao Zhang · Haojun Yu · Dong Wang · Liwei Wang
|
ExHall D Poster #454 | |
Task-driven Image Fusion with Learnable Fusion Loss
Haowen Bai · Jiangshe Zhang · Zixiang Zhao · Yichen Wu · Lilun Deng · Yukun Cui · Tao Feng · Shuang Xu
|
ExHall D Poster #200 | |
Synthetic Visual Genome
Poster Session 2
Jae Sung Park · Zixian Ma · Linjie Li · Chenhao Zheng · Cheng-Yu Hsieh · Ximing Lu · Khyathi Chandu · Quan Kong · Norimasa Kobori · Ali Farhadi · Yejin Choi · Ranjay Krishna
|
ExHall D Poster #354 | |
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
Poster Session 1
Benlin Liu · Yuhao Dong · Yiqin Wang · Zixian Ma · Yansong Tang · Luming Tang · Yongming Rao · Wei-Chiu Ma · Ranjay Krishna
|
ExHall D Poster #344 | |
Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual
Poster Session 5
Chong Wang · Lanqing Guo · Zixuan Fu · SIYUAN YANG · Hao Cheng · Alex C. Kot · Bihan Wen
|
ExHall D Poster #205 | |
ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation
Poster Session 1
Zirun Guo · Tao Jin
|
ExHall D Poster #266 | |
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
Poster Session 5
Zixuan Hu · Yongxian Wei · Li Shen · Chun Yuan · Dacheng Tao
|
ExHall D Poster #381 | |
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning
Poster Session 5
Hairui Ren · Fan Tang · He Zhao · Zixuan Wang · Dandan Guo · Yi Chang
|
ExHall D Poster #391 | |
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Poster Session 3
Zixuan Wang · DUO PENG · Feng Chen · Yuwei Yang · Yinjie Lei
|
ExHall D Poster #237 | |
Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields
Poster Session 5
Runfeng Li · Mikhail Okunev · Zixuan Guo · Anh H Duong · Christian Richardt · Matthew O’Toole · James Tompkin
|
ExHall D Poster #85 | |
InsightEdit: Towards Better Instruction Following for Image Editing
Poster Session 1
Yingjing Xu · Jie Kong · Jiazhi Wang · Xiao Pan · Bo Lin · Qiang Liu
|
ExHall D Poster #242 | |
Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence
Poster Session 1
Haolin Liu · Xiaohang Zhan · Zizheng Yan · Zhongjin Luo · Yuxin Wen · Xiaoguang Han
|
ExHall D Poster #70 | |
Efficient Transfer Learning for Video-language Foundation Models
Poster Session 6
Haoxing Chen · Zizheng Huang · Yan Hong · YANSHUO WANG · Zhongcai Lyu · Zhuoer Xu · Jun Lan · Zhangxuan Gu
|
ExHall D Poster #285 | |
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Poster Session 3
Chengyue Wu · Xiaokang Chen · Zhiyu Wu · Yiyang Ma · Xingchao Liu · Zizheng Pan · Wen Liu · Zhenda Xie · Xingkai Yu · Chong Ruan · Ping Luo
|
ExHall D Poster #221 | |
MINIMA: Modality Invariant Image Matching
Poster Session 5
Jiangwei Ren · Xingyu Jiang · Zizhuo Li · Dingkang Liang · Xin Zhou · Xiang Bai
|
ExHall D Poster #190 | |
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
Poster Session 3
Zhaoqing Zhu · Chuwei Luo · Zirui Shao · Feiyu Gao · Hangdi Xing · Qi Zheng · Ji Zhang
|
ExHall D Poster #365 | |
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
Poster Session 2
Hongrui Jia · Chaoya Jiang · Haiyang Xu · Wei Ye · Mengfan Dong · Ming Yan · Ji Zhang · Fei Huang · Shikun Zhang
|
ExHall D Poster #380 | |
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
Poster Session 2
Yiyang Du · Xiaochen Wang · Chi Chen · Jiabo Ye · Yiru Wang · Peng Li · Ming Yan · Ji Zhang · Fei Huang · Zhifang Sui · Maosong Sun · Yang Liu
|
ExHall D Poster #385 | |
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Poster Session 2
Yilun Zhao · Lujing Xie · Haowei Zhang · Guo Gan · Weiyuan Chen · Yitao Long · Tongyan Hu · Zhijian Xu · Chengye Wang · Chuhan Li · Ziyao Shangguan · Yixin Liu · Zhenwen Liang · Zhiyuan Hu · Chen Zhao · Arman Cohan
|
ExHall D Poster #296 | |
DreamOmni: Unified Image Generation and Editing
Poster Session 6
Bin Xia · Yuechen Zhang · Jingyao Li · Chengyao Wang · Yitong Wang · Xinglong Wu · Bei Yu · Jiaya Jia
|
ExHall D Poster #226 | |
SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
Poster Session 2
Zilan Wang · Junfeng Guo · Jiacheng Zhu · Yiming Li · Heng Huang · Muhao Chen · Zhengzhong Tu
|
ExHall D Poster #271 | |
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Poster Session 3
Zhiwei Jia · Yuesong Nan · Huixi Zhao · Gengdai Liu
|
ExHall D Poster #216 | |
Vid2Avatar-Pro: Authentic Avatar from Videos in the Wild via Universal Prior
Poster Session 2
Chen Guo · Junxuan Li · Yash Kant · Yaser Sheikh · Shunsuke Saito · Chen Cao
|
ExHall D Poster #11 | |
LUCAS: Layered Universal Codec Avatars
Poster Session 5
Di Liu · Teng Deng · Giljoo Nam · Yu Rong · Stanislav Pidhorskyi · Junxuan Li · Jason Saragih · Dimitris N. Metaxas · Chen Cao
|
ExHall D Poster #8 | |
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Poster Session 2
Jiawei Zhang · Zijian Wu · Zhiyang Liang · Yicheng Gong · Dongfang Hu · Yao Yao · Xun Cao · Hao Zhu
|
ExHall D Poster #8 | |
LMO: Linear Mamba Operator for MRI Reconstruction
Poster Session 1
Wei Li · jiawei jiang · Jie Wu · Kaihao Yu · Jianwei Zheng
|
ExHall D Poster #473 | |
Learning Visual Generative Priors without Text
Poster Session 2
Shuailei Ma · Kecheng Zheng · Ying Wei · Wei Wu · Fan Lu · Yifei Zhang · Chen-Wei Xie · Biao Gong · Jiapeng Zhu · Yujun Shen
|
ExHall D Poster #256 | |
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
Poster Session 4
Jiapeng Zhu · Ceyuan Yang · Kecheng Zheng · Yinghao Xu · Zifan Shi · Yifei Zhang · Qifeng Chen · Yujun Shen
|
ExHall D Poster #250 | |
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
Poster Session 1
Pengfei Zhou · Xiaopeng Peng · Jiajun Song · Chuanhao Li · Zhaopan Xu · Yue Yang · Ziyao Guo · Hao Zhang · Yuqi Lin · Yefei He · Lirui Zhao · Shuo Liu · Tianhua Li · Yuxuan Xie · Xiaojun Chang · Yu Qiao · Wenqi Shao · Kaipeng Zhang
|
ExHall D Poster #245 | |
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
Poster Session 4
Fan Lu · Wei Wu · Kecheng Zheng · Shuailei Ma · Biao Gong · Jiawei Liu · Wei Zhai · Yang Cao · Yujun Shen · Zheng-Jun Zha
|
ExHall D Poster #363 | |
Contextual AD Narration with Interleaved Multimodal Sequence
Poster Session 2
Hanlin Wang · Zhan Tong · Kecheng Zheng · Yujun Shen · Limin Wang
|
ExHall D Poster #287 | |
Mimir: Improving Video Diffusion Models for Precise Text Understanding
Poster Session 5
Shuai Tan · Biao Gong · Yutong Feng · Kecheng Zheng · DanDan Zheng · Shuwei Shi · Yujun Shen · Jingdong Chen · Ming Yang
|
ExHall D Poster #283 | |
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Poster Session 5
Shuwei Shi · Biao Gong · Xi Chen · DanDan Zheng · Shuai Tan · Zizheng Yang · Yuyuan Li · Jingwen He · Kecheng Zheng · Jingdong Chen · Ming Yang · Yinqiang Zheng
|
ExHall D Poster #172 | |
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering
Poster Session 1
Chengyue Huang · Brisa Maneechotesuwan · Shivang Chopra · Zsolt Kira
|
ExHall D Poster #356 | |
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Poster Session 2
Yiheng Li · Yang Yang · Zichang Tan · Huan Liu · Weihua Chen · Xu Zhou · Zhen Lei
|
ExHall D Poster #369 | |
Bayesian Test-Time Adaptation for Vision-Language Models
Poster Session 6
Lihua Zhou · Mao Ye · Shuaifeng Li · Nianxin Li · Xiatian Zhu · Lei Deng · Hongbin Liu · Zhen Lei
|
ExHall D Poster #368 | |
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization
Poster Session 4
Siyuan Li · Luyuan Zhang · Zedong Wang · Juanxi Tian · Cheng Tan · Zicheng Liu · Chang Yu · Qingsong Xie · Haonan Lu · Haoqian Wang · Zhen Lei
|
ExHall D Poster #373 | |
DaCapo: Score Distillation as Stacked Bridge for Fast and High-quality 3D Editing
Poster Session 4
Yufei Huang · Bangyan Liao · Yuqi Hu · Haitao Lin · Lirong Wu · Siyuan Li · Cheng Tan · Zicheng Liu · Yunfan Liu · Zelin Zang · Chang Yu · Zhen Lei
|
ExHall D Poster #43 | |
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary
Poster Session 1
Zeqi Gu · Yin Cui · Max Li · Fangyin Wei · Yunhao Ge · Jinwei Gu · Ming-Yu Liu · Abe Davis · Yifan Ding
|
ExHall D Poster #261 | |
Model Diagnosis and Correction via Linguistic and Implicit Attribute Editing
Poster Session 3
Xuanbai Chen · Xiang Xu · Zhihua Li · Tianchen Zhao · Pietro Perona · Qin ZHANG · Yifan Xing
|
ExHall D Poster #347 | |
MagicQuill: An Intelligent Interactive Image Editing System
Poster Session 3
Zichen Liu · Yue Yu · Hao Ouyang · Qiuyu Wang · Ka Leong Cheng · Wen Wang · Zhiheng Liu · Qifeng Chen · Yujun Shen
|
ExHall D Poster #231 | |
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Poster Session 2
Kai Chen · Yunhao Gou · Runhui Huang · Zhili Liu · Daxin Tan · Jing Xu · Chunwei Wang · Yi Zhu · yihan zeng · Kuo Yang · Dingdong WANG · Kun Xiang · Haoyuan Li · Haoli Bai · Jianhua Han · Xiao-Hui Li · Weike Jin · Nian Xie · Yu Zhang · James Kwok · Hengshuang Zhao · Xiaodan Liang · Dit-Yan Yeung · Xiao Chen · Zhenguo Li · Wei Zhang · Qun Liu · Lanqing Hong · Lu Hou · Hang Xu
|
ExHall D Poster #1 | |
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Poster Session 4
Da-Wei Zhou · Zi-Wen Cai · Han-Jia Ye · Lijun Zhang · De-Chuan Zhan
|
ExHall D Poster #451 | |
Improved Video VAE for Latent Video Diffusion Model
Poster Session 4
Pingyu Wu · Kai Zhu · Yu Liu · Liming Zhao · Wei Zhai · Yang Cao · Zheng-Jun Zha
|
ExHall D Poster #221 | |
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large Language Models
Poster Session 3
Hao Yin · Guangzong Si · Zilei Wang
|
ExHall D Poster #381 | |
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
Poster Session 2
Zhihang Liu · Chen-Wei Xie · Pandeng Li · Liming Zhao · Longxiang Tang · Yun Zheng · Chuanbin Liu · Hongtao Xie
|
ExHall D Poster #305 | |
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
Yinhuai Wang · Qihan Zhao · Runyi Yu · Hok Wai Tsui · Ailing Zeng · Jing Lin · Zhengyi Luo · Jiwen Yu · Xiu Li · Qifeng Chen · Jian Zhang · Lei Zhang · Ping Tan
|
ExHall D Poster #166 | |
Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing
Poster Session 4
Chen Liao · Yan Shen · Dan Li · Zhongli Wang
|
ExHall D Poster #209 | |
Towards Continual Universal Segmentation
Poster Session 6
Zihan Lin · Zilei Wang · Xu Wang
|
ExHall D Poster #312 | |
R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning
Poster Session 6
Lijun Sheng · Jian Liang · Zilei Wang · Ran He
|
ExHall D Poster #364 | |
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference
Poster Session 2
Hao Yin · Guangzong Si · Zilei Wang
|
ExHall D Poster #382 | |
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video
Poster Session 3
Zhaolin Wan · Han Qin · Zhiyang Li · Xiaopeng Fan · Wangmeng Zuo · Debin Zhao
|
ExHall D Poster #187 | |
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
Poster Session 4
Haoran You · Connelly Barnes · Yuqian Zhou · Yan Kang · Zhenbang Du · Wei Zhou · Lingzhi Zhang · Yotam Nitzan · Xiaoyang Liu · Zhe Lin · Eli Shechtman · Sohrab Amirghodsi · Yingyan (Celine) Lin
|
ExHall D Poster #216 | |
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
Poster Session 4
Yuxuan Wang · Yueqian Wang · Bo Chen · Tong Wu · Dongyan Zhao · Zilong Zheng
|
ExHall D Poster #299 | |
ReCap: Better Gaussian Relighting with Cross-Environment Captures
Poster Session 5
Jingzhi Li · Zongwei Wu · Eduard Zamfir · Radu Timofte
|
ExHall D Poster #25 | |
Complexity Experts are Task-Discriminative Learners for Any Image Restoration
Poster Session 3
Eduard Zamfir · Zongwei Wu · Nancy Mehta · Yuedong Tan · Danda Paudel · Yulun Zhang · Radu Timofte
|
ExHall D Poster #201 | |
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Poster Session 6
Sitong Gong · Yunzhi Zhuge · Lu Zhang · Zongxin Yang · Pingping Zhang · Huchuan Lu
|
ExHall D Poster #290 | |
HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation
Poster Session 4
Hongwei Zheng · Han Li · Wenrui Dai · Ziyang Zheng · Chenglin Li · Junni Zou · Hongkai Xiong
|
ExHall D Poster #94 | |
Stabilizing and Accelerating Autofocus with Expert Trajectory Regularized Deep Reinforcement Learning
Poster Session 6
Shouhang Zhu · Chenglin Li · Yuankun Jiang · Li Wei · Nuowen Kan · Ziyang Zheng · Wenrui Dai · Junni Zou · Hongkai Xiong
|
ExHall D Poster #24 | |
PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval
Poster Session 4
Qiang Zou · Shuli Cheng · Jiayi Chen
|
ExHall D Poster #366 | |
Improve Representation for Imbalanced Regression through Geometric Constraints
Poster Session 1
Zijian Dong · Yilei Wu · Chongyao Chen · Yingtian Zou · Yichi Zhang · Juan Helen Zhou
|
ExHall D Poster #470 | |
Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution
Poster Session 2
Hang Xu · Jie Huang · Wei Yu · Jiangtong Tan · Zhen Zou · Feng Zhao
|
ExHall D Poster #205 | |
Leveraging SD Map to Augment HD Map-based Trajectory Prediction
Poster Session 4
Zhiwei Dong · Ran Ding · Wei Li · Zhang Peng · Guobin Tang · Jia Guo
|
ExHall D Poster #134 | |
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
Poster Session 5
Zehuan Huang · Yuanchen Guo · Xingqiao An · Yunhan Yang · Yangguang Li · Zi-Xin Zou · Ding Liang · Xihui Liu · Yan-Pei Cao · Lu Sheng
|
ExHall D Poster #250 | |
EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting
Poster Session 2
Suzhen Wang · Weijie Chen · Wei Zhang · Minda Zhao · Lincheng Li · Rongsheng Zhang · Zhipeng Hu · Xin Yu
|
ExHall D Poster #13 | |
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Poster Session 5
Wanhua Li · Renping Zhou · Jiawei Zhou · Yingwei Song · Johannes Herter · Minghan Qin · Gao Huang · Hanspeter Pfister
|
ExHall D Poster #91 | |
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
Poster Session 2
Liang Pan · Zeshi Yang · Zhiyang Dou · Wenjia Wang · Buzhen Huang · Bo Dai · Taku Komura · Jingbo Wang
|
ExHall D Poster #159 | |
UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning
Weiqi Yan · Lvhai Chen · Huaijia Kou · Shengchuan Zhang · Yan Zhang · Liujuan Cao
|
ExHall D Poster #404 | |
Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization
Poster Session 4
You Shen · Zhipeng Zhang · Xinyang Li · Yansong Qu · Yu Lin · Shengchuan Zhang · Liujuan Cao
|
ExHall D Poster #48 | |
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Poster Session 2
Sicheng Zuo · Wenzhao Zheng · Yuanhui Huang · Jie Zhou · Jiwen Lu
|
ExHall D Poster #134 | |
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
Poster Session 1
Junying Wang · Jingyuan Liu · Xin Sun · Krishna Kumar Singh · ZHIXIN SHU · He Zhang · Jimei Yang · Nanxuan Zhao · Tuanfeng Y. Wang · Simon Su Chen · Ulrich Neumann · Jae Shin Yoon
|
ExHall D Poster #20 | |
Free-viewpoint Human Animation with Pose-correlated Reference Selection
Fa-Ting Hong · Zhan Xu · Haiyang Liu · Qinjie Lin · Luchuan Song · ZHIXIN SHU · Yang Zhou · Duygu Ceylan · Dan Xu
|
ExHall D Poster #5 | |
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Poster Session 1
Sumit Chaturvedi · Mengwei Ren · Yannick Hold-Geoffroy · Jingyuan Liu · Julie Dorsey · ZHIXIN SHU
|
ExHall D Poster #19 | |
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Poster Session 4
Hanwen Jiang · Zexiang Xu · Desai Xie · Chen Ziwen · Haian Jin · Fujun Luan · ZHIXIN SHU · Kai Zhang · Sai Bi · Xin Sun · Jiuxiang Gu · Qixing Huang · Georgios Pavlakos · Hao Tan
|
ExHall D Poster #57 | |
Goku: Flow Based Video Generative Foundation Models
Shoufa Chen · Chongjian GE · Yuqi Zhang · Yida Zhang · Fengda Zhu · Hao Yang · Hongxiang Hao · hui wu · Zhichao Lai · Yifei Hu · Ting-Che Lin · Shilong Zhang · Fu Li · Chuan Li · Xing Wang · Yanghua Peng · Peize Sun · Ping Luo · Yi Jiang · Zehuan Yuan · BINGYUE PENG · Xiaobing Liu
|
ExHall D Poster #235 | |
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Poster Session 2
Yifan Pu · Yiming Zhao · Zhicong Tang · Ruihong Yin · Haoxing Ye · Yuhui Yuan · Dong Chen · Jianmin Bao · Sirui Zhang · Yanbin Wang · Lin Liang · Lijuan Wang · Ji Li · Xiu Li · Zhouhui Lian · Gao Huang · Baining Guo
|
ExHall D Poster #247 | |
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Poster Session 5
Zongsheng Yue · Kang Liao · Chen Change Loy
|
ExHall D Poster #199 | |
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds
Poster Session 2
Zhenggang Tang · Yuchen Fan · Dilin Wang · Hongyu Xu · Rakesh Ranjan · Alexander G. Schwing · Zhicheng Yan
|
ExHall D Poster #57 | |
D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.
Poster Session 4
Haoran Wang · Xinji Mai · Zeng Tao · Xuan Tong · Junxiong Lin · Yan Wang · Jiawen Yu · Shaoqi Yan · Ziheng Zhou · Wenqiang Zhang
|
ExHall D Poster #326 | |
Less Attention is More: Prompt Transformer for Generalized Category Discovery
Poster Session 6
Wei Zhang · Baopeng Zhang · Zhu Teng · Wenxin Luo · Junnan Zou · Jianping Fan
|
ExHall D Poster #400 | |
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection
Poster Session 5
Gensheng Pei · Tao Chen · Yujia Wang · Xinhao Cai · Xiangbo Shu · Tianfei Zhou · Yazhou Yao
|
ExHall D Poster #366 | |
Scaling up Image Segmentation across Data and Tasks
Poster Session 1
Pei Wang · Zhaowei Cai · Hao Yang · Ashwin Swaminathan · R. Manmatha · Stefano Soatto
|
ExHall D Poster #422 | |
PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation
Poster Session 1
Zidong Cao · Jinjing Zhu · Weiming Zhang · Hao Ai · Haotian Bai · Hengshuang Zhao · Lin Wang
|
ExHall D Poster #76 | |
Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives
Poster Session 5
Alex Hanson · Allen Tu · Geng Lin · Vasu Singla · Matthias Zwicker · Tom Goldstein
|
ExHall D Poster #46 | |
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting
Poster Session 2
Alex Hanson · Allen Tu · Vasu Singla · Bethmage Mayuka Jayawardhana · Matthias Zwicker · Tom Goldstein
|
ExHall D Poster #48 | |
CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging
Poster Session 1
Zhiwei Ling · Yachen Chang · Hailiang Zhao · Xinkui Zhao · Kingsum Chow · Shuiguang Deng
|
ExHall D Poster #459 | |
Material Anything: Generating Materials for Any 3D Object via Diffusion
Xin Huang · Tengfei Wang · Ziwei Liu · Qing Wang
|
ExHall D Poster #38 | |
Generative Gaussian Splatting for Unbounded 3D City Generation
Poster Session 2
Haozhe Xie · Zhaoxi Chen · Fangzhou Hong · Ziwei Liu
|
ExHall D Poster #64 | |
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
Poster Session 6
Xiang Xu · Lingdong Kong · hui shuai · Liang Pan · Ziwei Liu · Qingshan Liu
|
ExHall D Poster #116 | |
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
Poster Session 6
Zhaoxi Chen · Jiaxiang Tang · Yuhao Dong · Ziang Cao · Fangzhou Hong · Yushi Lan · Tengfei Wang · Haozhe Xie · Tong Wu · Shunsuke Saito · Liang Pan · Dahua Lin · Ziwei Liu
|
ExHall D Poster #40 | |
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Poster Session 3
Jiazhi Guan · Kaisiyuan Wang · Zhiliang Xu · Quanwei Yang · Yasheng SUN · Shengyi He · Borong Liang · Yukang Cao · Yingying Li · Haocheng Feng · Errui Ding · Jingdong Wang · Youjian Zhao · Hang Zhou · Ziwei Liu
|
ExHall D Poster #3 | |
EgoLife: Towards Egocentric Life Assistant
Poster Session 6
Jingkang Yang · Shuai Liu · Hongming Guo · Yuhao Dong · Xiamengwei Zhang · Sicheng Zhang · Pengyun Wang · Zitang Zhou · Binzhu Xie · Ziyue Wang · Bei Ouyang · Zhengyu Lin · Marco Cominelli · Zhongang Cai · Bo Li · Yuanhan Zhang · Peiyuan Zhang · Fangzhou Hong · Joerg Widmer · Francesco Gringoli · Lei Yang · Ziwei Liu
|
ExHall D Poster #259 | |
Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deformation
Poster Session 5
Takeshi Noda · Chao Chen · Junsheng Zhou · Weiqi Zhang · Yu-Shen Liu · Zhizhong Han
|
ExHall D Poster #104 | |
A Unified Model for Compressed Sensing MRI Across Undersampling Patterns
Poster Session 5
Armeet Singh Jatyani · Jiayun Wang · Aditi Chandrashekar · Zihui Wu · Miguel Liu-Schiaffini · Bahareh Tolooshams · Anima Anandkumar
|
ExHall D Poster #477 | |
MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model
Poster Session 3
Haoyuan Wang · Zhenwei Wang · Xiaoxiao Long · Cheng Lin · Gerhard Hancke · Rynson W.H. Lau
|
ExHall D Poster #32 | |
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation
Poster Session 4
Zhiwei Yang · Yucong Meng · Kexue Fu · feilong tang · Shuo Wang · Zhijian Song
|
ExHall D Poster #421 | |
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
Poster Session 2
Zilin Xiao · Pavel Suma · Ayush Sachdeva · Hao-Jen Wang · Giorgos Kordopatis-Zilos · Giorgos Tolias · Vicente Ordonez
|
ExHall D Poster #401 | |
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner
Poster Session 2
Yuyang Huang · Yabo Chen · Li Ding · Xiaopeng Zhang · Wenrui Dai · Junni Zou · Hongkai Xiong · Qi Tian
|
ExHall D Poster #182 | |
Watermarking One for All: A Robust Watermarking Scheme Against Partial Image Theft
Poster Session 2
Gaozhi Liu · Silu Cao · Zhenxing Qian · Xinpeng Zhang · Sheng Li · Wanli Peng
|
ExHall D Poster #272 | |
Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images
Poster Session 2
Nan Zhong · Haoyu Chen · Yiran Xu · Zhenxing Qian · Xinpeng Zhang
|
ExHall D Poster #275 | |
Unified Medical Lesion Segmentation via Self-referring Indicator
Poster Session 2
Shijie Chang · Xiaoqi Zhao · Lihe Zhang · Tiancheng Wang
|
ExHall D Poster #480 | |
PIAD: Pose and Illumination agnostic Anomaly Detection
Poster Session 1
Kaichen Yang · Junjie Cao · Zeyu Bai · Zhixun Su · Andrea Tagliasacchi
|
ExHall D Poster #437 | |
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection
Poster Session 2
Wei Luo · Yunkang Cao · Haiming Yao · Xiaotian Zhang · Jianan Lou · Yuqi Cheng · Weiming Shen · Wenyong Yu
|
ExHall D Poster #438 | |
Incremental Object Keypoint Learning
Poster Session 5
Mingfu Liang · Jiahuan Zhou · Xu Zou · Ying Wu
|
ExHall D Poster #416 | |
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
Zhenhua Xu · Yan Bai · Yujia Zhang · Zhuoling Li · Fei Xia · Kwan-Yee K. Wong · Jianqiang Wang · Hengshuang Zhao
|
ExHall D Poster #138 | |
DORNet: A Degradation Oriented and Regularized Network for Blind Depth Super-Resolution
Poster Session 4
Zhengxue Wang · Zhiqiang Yan · Jinshan Pan · Guangwei Gao · Kai Zhang · Jian Yang
|
ExHall D Poster #46 | |
Hunyuan-Portrait: Implicit Condition Control for Enhanced Portrait Animation
Poster Session 4
Zunnan Xu · Zhentao Yu · Zixiang Zhou · Jun Zhou · Xiaoyu Jin · Fa-Ting Hong · Xiaozhong Ji · Junwei Zhu · Chengfei Cai · Shiyu Tang · Qin Lin · Xiu Li · qinglin lu
|
ExHall D Poster #5 | |
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
Poster Session 3
Zichen Liu · Kunlun Xu · Bing Su · Xu Zou · Yuxin Peng · Jiahuan Zhou
|
ExHall D Poster #299 | |
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Poster Session 1
Xin Yan · Yuxuan Cai · Qiuyue Wang · Yuan Zhou · Wenhao Huang · Huan Yang
|
ExHall D Poster #289 | |
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
Poster Session 6
chenkai zhang · Yiming Lei · Zeming Liu · Haitao Leng · Shaoguo Liu · Tingting Gao · Qingjie Liu · Yunhong Wang
|
ExHall D Poster #273 | |
From Poses to Identity: Training-Free Person Re-Identification via Feature Centralization
Poster Session 5
Chao Yuan · Guiwei Zhang · Changxiao Ma · Tianyi Zhang · Guanglin Niu
|
ExHall D Poster #323 | |
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Poster Session 2
Xinhao Liu · Jintong Li · Yicheng Jiang · Niranjan Sujay · Zhicheng Yang · Juexiao Zhang · John Abanes · Jing Zhang · Chen Feng
|
ExHall D Poster #144 | |
SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
Poster Session 3
Guibiao Liao · Qing Li · Zhenyu Bao · Guoping Qiu · KANGLIN LIU
|
ExHall D Poster #58 | |
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
Zhenrong Wang · Qi Zheng · Sihan Ma · Maosheng Ye · Yibing Zhan · Dongjiang Li
|
ExHall D Poster #147 | |
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
Poster Session 6
feilong tang · Chengzhi Liu · Zhongxing Xu · Ming Hu · Zile Huang · Haochen Xue · Ziyang Chen · Zelin Peng · Zhiwei Yang · Sijin Zhou · Wenxue Li · Yulong Li · Wenxuan Song · Shiyan Su · Wei Feng · Jionglong Su · Mingquan Lin · Yifan Peng · Xuelian Cheng · Imran Razzak · Zongyuan Ge
|
ExHall D Poster #272 | |
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Poster Session 1
Yikun Liu · Pingan Chen · jiayin cai · Xiaolong Jiang · Yao Hu · Jiangchao Yao · Yanfeng Wang · Weidi Xie
|
ExHall D Poster #366 | |
DCEvo: Discriminative Cross-Dimensional Evolutionary Learning for Infrared and Visible Image Fusion
Poster Session 1
Jinyuan Liu · Bowei Zhang · Qingyun Mei · Xingyuan Li · Yang Zou · Zhiying Jiang · Long Ma · Risheng Liu · Xin Fan
|
ExHall D Poster #193 | |
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-Resolution
Poster Session 2
Xingyuan Li · Zirui Wang · Yang Zou · Zhixin Chen · Jun Ma · Zhiying Jiang · Long Ma · Jinyuan Liu
|
ExHall D Poster #207 | |
Human Motion Instruction Tuning
Poster Session 4
Lei Li · Sen Jia · Jianhao Wang · Zhongyu Jiang · Feng Zhou · Ju Dai · Tianfang Zhang · Zongkai Wu · Jenq-Neng Hwang
|
ExHall D Poster #170 | |
RealEdit: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations
Poster Session 3
Peter Sushko · Ayana Bharadwaj · Zhi Yang Lim · Vasily Ilin · Ben Caffee · Dongping Chen · Reza Salehi · Cheng-Yu Hsieh · Ranjay Krishna
|
ExHall D Poster #263 | |
Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models
Poster Session 5
Zhaoyi Liu · Huan Zhang
|
ExHall D Poster #384 | |
Continuous Adverse Weather Removal via Degradation-Aware Distillation
Poster Session 6
Xin Lu · Jie Xiao · Yurui Zhu · Xueyang Fu
|
ExHall D Poster #185 | |
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Poster Session 4
Zeyue Tian · Zhaoyang Liu · Ruibin Yuan · Jiahao Pan · Qifeng Liu · Xu Tan · Qifeng Chen · Wei Xue · Yike Guo
|
ExHall D Poster #286 | |
Cross-modal Causal Relation Alignment for Video Question Grounding
weixing chen · Yang Liu · Binglin Chen · Jiandong Su · Yongsen Zheng · Liang Lin
|
ExHall D Poster #293 | |
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
Poster Session 3
Mingcheng Li · Xiaolu Hou · Ziyang Liu · Dingkang Yang · Ziyun Qian · Jiawei Chen · Jinjie Wei · Yue Jiang · Qingyao Xu · Lihua Zhang
|
ExHall D Poster #249 | |
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
Poster Session 3
Zicheng Zhang · Tengchuan Kou · Chunyi Li · Shushi Wang · Wei Sun · Wei Wang · Xiaoyu Li · ZongYu Wang · Xuezhi Cao · Xiongkuo Min · Xiaohong Liu · Guangtao Zhai
|
ExHall D Poster #358 | |
Multitwine: Multi-Object Compositing with Text and Layout Control
Gemma Canet Tarrés · Zhe Lin · Zhifei Zhang · He Zhang · Andrew Gilbert · John Collomosse · Soo Ye Kim
|
ExHall D Poster #260 | |
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing
Poster Session 5
Ruiyi Wang · Yushuo Zheng · Zicheng Zhang · Chunyi Li · Shuaicheng Liu · Guangtao Zhai · Xiaohong Liu
|
ExHall D Poster #193 | |
Cross-modal Information Flow in Multimodal Large Language Models
Poster Session 4
Zhi Zhang · Srishti Yadav · Fengze Han · Ekaterina Shutova
|
ExHall D Poster #379 | |
Image Quality Assessment: From Human to Machine Preference
Chunyi Li · Yuan Tian · Xiaoyue Ling · Zicheng Zhang · Haodong Duan · Haoning Wu · Ziheng Jia · Xiaohong Liu · Xiongkuo Min · Guo Lu · Weisi Lin · Guangtao Zhai
|
ExHall D Poster #210 | |
Continuous Space-Time Video Resampling with Invertible Motion Steganography
Poster Session 1
Yuantong zhang · Zhenzhong Chen
|
ExHall D Poster #183 | |
Fitted Neural Lossless Image Compression
Poster Session 5
Zhe Zhang · Zhenzhong Chen · Shan Liu
|
ExHall D Poster #209 | |
HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion
Poster Session 5
Ding Ding · Yueming Pan · Ruoyu Feng · Qi Dai · Kai Qiu · Jianmin Bao · Chong Luo · Zhenzhong Chen
|
ExHall D Poster #180 | |
Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation
Poster Session 2
Yuxin Li · Zihao Zhu · Yuxiang Zhang · Yifan Chen · Zhibin Yu
|
ExHall D Poster #478 | |
Improving the Training of Data-Efficient GANs via Quality Aware Dynamic Discriminator Rejection Sampling
Poster Session 6
Zhaoyu Zhang · Yang Hua · Guanxiong Sun · Hui Wang · Seán F. McLoone
|
ExHall D Poster #434 | |
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen · Zhifei Zhang · He Zhang · Yuqian Zhou · Soo Ye Kim · Qing Liu · Yijun Li · Jianming Zhang · Nanxuan Zhao · Yilin Wang · Hui Ding · Zhe Lin · Hengshuang Zhao
|
ExHall D Poster #176 | |
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Poster Session 2
Teng Hu · Jiangning Zhang · Ran Yi · Jieyu Weng · Yabiao Wang · Xianfang Zeng · Xuezhucun Xue · Lizhuang Ma
|
ExHall D Poster #379 | |
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
Poster Session 2
Mingju Gao · Yike Pan · Huan-ang Gao · Zongzheng Zhang · Wenyi Li · Hao Dong · Hao Tang · Li Yi · Hao Zhao
|
ExHall D Poster #156 | |
Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization
Poster Session 6
Xiran Wang · Jian Zhang · Lei Qi · Yinghuan Shi
|
ExHall D Poster #424 | |
ODA-GAN: Orthogonal Decoupling Alignment GAN Assisted by Weakly-supervised Learning for Virtual Immunohistochemistry Staining
Poster Session 5
Tong Wang · Mingkang Wang · Zhongze Wang · Hongkai Wang · Qi Xu · Fengyu Cong · Hongming Xu
|
ExHall D Poster #469 | |
MonSter: Marry Monodepth to Stereo Unleashes Power
JunDa Cheng · Longliang Liu · Gangwei Xu · Xianqi Wang · Zhaoxing Zhang · Yong Deng · Jinliang Zang · Yurui Chen · zhipeng cai · Xin Yang
|
ExHall D Poster #82 | |
Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction
Poster Session 1
Cecilia Curreli · Dominik Muhle · Abhishek Saroha · Zhenzhang Ye · Riccardo Marin · Daniel Cremers
|
ExHall D Poster #158 | |
One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion
Poster Session 6
Chunyang Cheng · Tianyang Xu · Zhenhua Feng · Xiaojun Wu · Zhangyong Tang · Hui Li · Zhang Zeyang · Sara Atito · Muhammad Awais · Josef Kittler
|
ExHall D Poster #184 | |
One-for-More: Continual Diffusion Model for Anomaly Detection
Poster Session 1
Xiaofan Li · Xin Tan · Zhuo Chen · Zhizhong Zhang · Ruixin Zhang · Rizen Guo · Guannan Jiang · Yulong Chen · Yanyun Qu · Lizhuang Ma · Yuan Xie
|
ExHall D Poster #440 | |
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Poster Session 2
Dongliang Luo · Hanshen Zhu · Ziyang Zhang · Dingkang Liang · Xudong Xie · Yuliang Liu · Xiang Bai
|
ExHall D Poster #377 | |
Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion
Poster Session 5
Yuxi Mi · Zhizhou Zhong · Yuge Huang · Qiuyang Yuan · Xuan Zhao · Jianqing Xu · Shouhong Ding · ShaoMing Wang · Rizen Guo · Shuigeng Zhou
|
ExHall D Poster #15 | |
Exploring Contextual Attribute Density in Referring Expression Counting
Poster Session 4
Zhicheng Wang · Zhiyu Pan · Zhan Peng · Jian Cheng · Liwen Xiao · Wei Jiang · Zhiguo Cao
|
ExHall D Poster #360 | |
DEAL: Data-Efficient Adversarial Learning for High-Quality Infrared Imaging
Poster Session 6
Zhu Liu · Zijun Wang · Jinyuan Liu · Fanqi Meng · Long Ma · Risheng Liu
|
ExHall D Poster #194 | |
Identity-Clothing Similarity Modeling for Unsupervised Clothing Change Person Re-Identification
Poster Session 4
Zhiqi Pang · Junjie Wang · Lingling Zhao · Chunyu Wang
|
ExHall D Poster #329 | |
Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces
Poster Session 4
Souhail Hadgi · Luca Moschella · Andrea Santilli · Diego Gomez · Qixing Huang · Emanuele Rodolà · Simone Melzi · Maks Ovsjanikov
|
ExHall D Poster #383 | |
Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning
Poster Session 3
Juntae Lee · Munawar Hayat · Sungrack Yun
|
ExHall D Poster #448 | |
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
Poster Session 2
Sung Jin Um · Dongjin Kim · Sangmin Lee · Jung Uk Kim
|
ExHall D Poster #284 | |
Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
Poster Session 6
Shuling Zhao · Fa-Ting Hong · Xiaoshui Huang · Dan Xu
|
ExHall D Poster #3 | |
Structure from Collision
Takuhiro Kaneko
|
ExHall D Poster #44 | |
Adapter Merging with Centroid Prototype Mapping for Scalable Class-Incremental Learning
Poster Session 1
Takuma Fukuda · Hiroshi Kera · Kazuhiko Kawamoto
|
ExHall D Poster #451 | |
CRISP: Object Pose and Shape Estimation with Test-Time Adaptation
Jingnan Shi · Rajat Talak · Harry Zhang · David Jin · Luca Carlone
|
ExHall D Poster #96 | |
EvOcc: Accurate Semantic Occupancy for Automated Driving Using Evidence Theory
Poster Session 6
Jonas Kälble · Sascha Wirges · Maxim Tatarchenko · Eddy Ilg
|
ExHall D Poster #125 | |
RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives
Poster Session 4
Chirag Parikh · Deepti Rawat · Rakshitha R. T. · Tathagata Ghosh · Ravi Kiran Sarvadevabhatla
|
ExHall D Poster #306 | |
Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures
Poster Session 1
Guoxing Sun · Rishabh Dabral · Heming Zhu · Pascal Fua · Christian Theobalt · Marc Habermann
|
ExHall D Poster #37 | |
VidSeg: Training-free Video Semantic Segmentation based on Diffusion Models
Poster Session 5
Qian Wang · Abdelrahman Eldesokey · Mohit Mendiratta · Fangneng Zhan · Adam Kortylewski · Christian Theobalt · Peter Wonka
|
ExHall D Poster #183 | |
A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations
Poster Session 1
Theo Bodrito · Olivier Flasseur · Julien Mairal · Jean Ponce · Maud Langlois · Anne-Marie Lagrange
|
ExHall D Poster #99 | |
Tuning the Frequencies: Robust Training for Sinusoidal Neural Networks
Tiago Novello · Diana Aldana Moreno · André Araujo · Luiz Velho
|
ExHall D Poster #278 | |
Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation
Poster Session 2
Tianfu Wang · Mingyang Xie · Haoming Cai · Sachin Shah · Christopher Metzler
|
ExHall D Poster #23 | |
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
Poster Session 1
Jiajun Deng · Tianyu He · Li Jiang · Tianyu Wang · Feras Dayoub · Ian Reid
|
ExHall D Poster #343 | |
Using Diffusion Priors for Video Amodal Segmentation
Poster Session 5
Kaihua Chen · Deva Ramanan · Tarasha Khurana
|
ExHall D Poster #174 | |
Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation
Poster Session 6
Joohyun Kwon · Hanbyel Cho · Junmo Kim
|
ExHall D Poster #68 | |
ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions
Poster Session 6
Tomas Soucek · Prajwal Gatti · Michael Wray · Ivan Laptev · Dima Damen · Josef Sivic
|
ExHall D Poster #122 | |
ESCAPE: Equivariant Shape Completion via Anchor Point Encoding
Poster Session 2
Burak Bekci · Nassir Navab · Federico Tombari · Mahdi Saleh
|
ExHall D Poster #105 | |
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models
Poster Session 3
Enis Simsar · Thomas Hofmann · Federico Tombari · Pinar Yanardag
|
ExHall D Poster #242 | |
Test-Time Visual In-Context Tuning
Poster Session 4
Jiahao Xie · Alessio Tonioni · Nathalie Rauschmayr · Federico Tombari · Bernt Schiele
|
ExHall D Poster #400 | |
Pose Priors from Language Models
Poster Session 2
Sanjay Subramanian · Evonne Ng · Lea Müller · Dan Klein · Shiry Ginosar · Trevor Darrell
|
ExHall D Poster #169 | |
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi · Boyi Li · Han Cai · Yao Lu · Sifei Liu · Marco Pavone · Jan Kautz · Song Han · Trevor Darrell · Pavlo Molchanov · Danny Yin
|
ExHall D Poster #406 | |
Towards Efficient Foundation Model for Zero-shot Amodal Segmentation
Poster Session 4
Zhaochen Liu · Limeng Qiao · Xiangxiang Chu · Lin Ma · Tingting Jiang
|
ExHall D Poster #424 | |
MET3R: Measuring Multi-View Consistency in Generated Images
Poster Session 2
Mohammad Asim · Christopher Wewer · Thomas Wimmer · Bernt Schiele · Jan Lenssen
|
ExHall D Poster #56 | |
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Poster Session 1
Zhixiong Nan · Xianghong Li · Tao Xiang · Jifeng Dai
|
ExHall D Poster #434 | |
Focusing on Tracks for Online Multi-Object Tracking
Poster Session 3
Kyujin Shim · Kangwook Ko · YuJin Yang · Changick Kim
|
ExHall D Poster #100 | |
Physical Plausibility-aware Trajectory Prediction via Locomotion Embodiment
Poster Session 3
Hiromu Taketsugu · Takeru Oba · Takahiro Maeda · Shohei Nobuhara · Norimichi Ukita
|
ExHall D Poster #160 | |
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments
Poster Session 4
Ege Özsoy · Chantal Pellegrini · Tobias Czempiel · Felix Tristram · Kun yuan · David Bani-Harouni · Ulrich Eck · Benjamin Busam · Matthias Keicher · Nassir Navab
|
ExHall D Poster #341 | |
DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching
Poster Session 3
Emanuele Aiello · Umberto Michieli · Diego Valsesia · Mete Ozay · Enrico Magli
|
ExHall D Poster #174 | |
MammAlps: A Multi-view Video Behavior Monitoring Dataset of Wild Mammals in the Swiss Alps
Valentin Gabeff · Haozhe Qi · Brendan Flaherty · Gencer Sumbul · Alexander Mathis · Devis Tuia
|
ExHall D Poster #306 | |
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
Poster Session 4
Arun Reddy · Alexander Martin · Eugene Yang · Andrew Yates · Kate Sanders · Kenton Murray · Reno Kriz · Celso M. de Melo · Benjamin Van Durme · Rama Chellappa
|
ExHall D Poster #370 | |
MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval
Poster Session 5
Reno Kriz · Kate Sanders · David Etter · Kenton Murray · Cameron Carpenter · Hannah Recknor · Jimena Guallar-Blasco · Alexander Martin · Eugene Yang · Benjamin Van Durme
|
ExHall D Poster #299 | |
PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields
Poster Session 3
Sean Wu · Shamik Basu · Tim Broedermann · Luc Van Gool · Christos Sakaridis
|
ExHall D Poster #31 | |
One2Any: One-Reference 6D Pose Estimation for Any Object
Poster Session 2
Mengya Liu · Siyuan Li · Ajad Chhatkuli · Prune Truong · Luc Van Gool · Federico Tombari
|
ExHall D Poster #103 | |
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Poster Session 5
Nicolas Dufour · Vicky Kalogeiton · David Picard · Loic Landrieu
|
ExHall D Poster #186 | |
SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving
Su Sun · Cheng Zhao · Zhuoyang Sun · Yingjie Chen · Mei Chen
|
ExHall D Poster #127 | |
Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Poster Session 3
Isma Hadji · Mehdi Noroozi · Victor Escorcia · Anestis Zaganidis · Brais Martinez · Georgios Tzimiropoulos
|
ExHall D Poster #204 | |
Learning from Streaming Video with Orthogonal Gradients
Poster Session 3
Tengda Han · Dilara Gokay · Joseph Heyward · Chuhan Zhang · Daniel Zoran · Viorica Patraucean · Joao Carreira · Dima Damen · Andrew Zisserman
|
ExHall D Poster #286 | |
Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance
Poster Session 1
Sanchayan Santra · Vishal Chudasama · Pankaj Wasnik · Vineeth Balasubramanian
|
ExHall D Poster #287 | |
Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content
Poster Session 6
Rohit Kundu · Hao Xiong · Vishal Mohanty · Athula Balachandran · Amit K. Roy-Chowdhury
|
ExHall D Poster #179 | |
Distilling Multi-modal Large Language Models for Autonomous Driving
Poster Session 6
Deepti Hegde · Rajeev Yasarla · Hong Cai · Shizhong Han · Apratim Bhattacharyya · Shweta Mahajan · Litian Liu · Risheek Garrepalli · Vishal M. Patel · Fatih Porikli
|
ExHall D Poster #135 | |
Attention IoU: Examining Biases in CelebA using Attention Maps
Poster Session 1
Aaron Serianni · Tyler Zhu · Olga Russakovsky · Vikram V. Ramaswamy
|
ExHall D Poster #405 | |
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Poster Session 4
Duc-Hai Pham · Tung Do · Phong Nguyen · Binh-Son Hua · Khoi Nguyen · Rang Nguyen
|
ExHall D Poster #119 | |
LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion.
Poster Session 4
Muchen Li · Sammy Christen · Chengde Wan · Yujun Cai · Renjie Liao · Leonid Sigal · Shugao Ma
|
ExHall D Poster #155 | |
Positive2Negative: Breaking the Information-Lossy Barrier in Self-Supervised Single Image Denoising
Poster Session 4
Tong Li · Lizhi Wang · Zhiyuan Xu · Lin Zhu · Wanxuan Lu · Hua Huang
|
ExHall D Poster #202 | |
Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network
Poster Session 5
Xingyu Qiu · Mengying Yang · Xinghua Ma · Fanding Li · Dong Liang · Gongning Luo · wei wang · Kuanquan Wang · Shuo Li
|
ExHall D Poster #207 | |
Reversible Decoupling Network for Single Image Reflection Removal
Poster Session 6
Hao Zhao · Mingjia Li · Qiming Hu · Xiaojie Guo
|
ExHall D Poster #23 | |
Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
Poster Session 5
Boming Miao · Chunxiao Li · Xiaoxiao Wang · Andi Zhang · Rui Sun · Zizhe Wang · Yao Zhu
|
ExHall D Poster #241 | |
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models
Poster Session 2
Sangwon Jang · June Suk Choi · Jaehyeong Jo · Kimin Lee · Sung Ju Hwang
|
ExHall D Poster #270 | |
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
Poster Session 4
Qiyao Xue · Xiangyu Yin · Boyuan Yang · Wei Gao
|
ExHall D Poster #290 | |
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis
Poster Session 6
Weiguang Zhao · Rui Zhang · Qiufeng Wang · Guangliang Cheng · Kaizhu Huang
|
ExHall D Poster #310 | |
GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation
Poster Session 6
Weihang Li · Hongli XU · Junwen Huang · HyunJun Jung · Kuan-Ting Yu · Nassir Navab · Benjamin Busam
|
ExHall D Poster #96 | |
Z-Magic: Zero-shot Multiple Attributes Guided Image Creator
Poster Session 4
Yingying Deng · Xiangyu He · Fan Tang · Weiming Dong
|
ExHall D Poster #247 | |
FoundationStereo: Zero-Shot Stereo Matching
Poster Session 2
Bowen Wen · Matthew Trepte · Oluwaseun Joseph Aribido · Jan Kautz · Orazio Gallo · Stan Birchfield
|
ExHall D Poster #81 | |
Any6D: Model-free 6D Pose Estimation of Novel Object
Poster Session 3
Taeyeop Lee · Bowen Wen · Minjun Kang · Gyuree Kang · In So Kweon · Kuk-Jin Yoon
|
ExHall D Poster #95 | |
Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation
Poster Session 3
Jiaxin Cai · Jingze Su · Qi Li · Wenjie Yang · Shu Wang · Tiesong Zhao · Shengfeng He · Wenxi Liu
|
ExHall D Poster #410 | |
CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework
Poster Session 2
Yanlong Xu · Haoxuan Qu · Jun Liu · Wenxiao Zhang · Xun Yang
|
ExHall D Poster #121 | |
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models
Poster Session 6
Namhyuk Ahn · KiYoon Yoo · Wonhyuk Ahn · Daesik Kim · Seung-Hun Nam
|
ExHall D Poster #251 | |
Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision
Poster Session 3
Jinneyong Kim · Seung-Hwan Baek
|
ExHall D Poster #80 | |
Dual Exposure Stereo for Extended Dynamic Range 3D Imaging
Poster Session 2
Juhyung Choi · Jinneyong Kim · Seokjun Choi · Jinwoo Lee · Samuel Brucker · Mario Bijelic · Felix Heide · Seung-Hwan Baek
|
ExHall D Poster #83 | |
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
Poster Session 3
Cheng Zhang · Haofei Xu · Qianyi Wu · Camilo Cruz Gambardella · Dinh Phung · Jianfei Cai
|
ExHall D Poster #74 | |
Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing
Poster Session 1
Shengzhi Wang · Yingkang Zhong · Jiangchuan Mu · Kai WU · Mingliang Xiong · Wen Fang · Mingqing Liu · Hao Deng · Bin He · Gang Li · Qingwen Liu
|
ExHall D Poster #179 | |
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
Poster Session 5
Yuyang Peng · Shishi Xiao · Keming Wu · Qisheng Liao · Bohan CHEN · Kevin Lin · Danqing Huang · Ji Li · Yuhui Yuan
|
ExHall D Poster #247 | |
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
Hanlin Wang · Hao Ouyang · Qiuyu Wang · Wen Wang · Ka Leong Cheng · Qifeng Chen · Yujun Shen · Limin Wang
|
ExHall D Poster #175 | |
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Poster Session 6
Weijia Wu · Mingyu Liu · Zeyu Zhu · Haoen Feng · Xi Xia · Wen Wang · Kevin Qinghong Lin · Chunhua Shen · Mike Zheng Shou
|
ExHall D Poster #271 | |
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning
Poster Session 6
Sherry X. Chen · Misha Sra · Pradeep Sen
|
ExHall D Poster #224 | |
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
Poster Session 4
Diljeet Jagpal · Xi Chen · Vinay P. Namboodiri
|
ExHall D Poster #231 | |
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
Poster Session 6
Yuji Wang · Haoran Xu · Yong Liu · Jiaze Li · Yansong Tang
|
ExHall D Poster #264 | |
Shape and Texture: What Influences Reliable Optical Flow Estimation?
Poster Session 6
Libo Long · Xiao Hu · Jochen Lang
|
ExHall D Poster #164 | |
Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters
Poster Session 6
Yuan Wang · Ouxiang Li · Tingting Mu · Yanbin Hao · Kuien Liu · Xiang Wang · Xiangnan He
|
ExHall D Poster #247 | |
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Poster Session 4
Caoshuo Li · Tanzhe Li · Xiaobin Hu · Donghao Luo · Taisong Jin
|
ExHall D Poster #415 | |
Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction
Seungtae Nam · Xiangyu Sun · Gyeongjin Kang · Younggeun Lee · Seungjun Oh · Eunbyung Park
|
ExHall D Poster #50 | |
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Poster Session 3
Xiao-Hui Li · Fei Yin · Cheng-Lin Liu
|
ExHall D Poster #418 | |
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility
Yidi Li · Jun Xiao · Zhengda Lu · Yiqun Wang · Haiyong Jiang
|
ExHall D Poster #263 | |
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
Poster Session 5
Haitong Liu · Kuofeng Gao · Yang Bai · Jinmin Li · Jinxiao Shan · Tao Dai · Shu-Tao Xia
|
ExHall D Poster #290 | |
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Poster Session 1
Haosen Yang · Adrian Bulat · Isma Hadji · Hai X. Pham · Xiatian Zhu · Georgios Tzimiropoulos · Brais Martinez
|
ExHall D Poster #219 | |
Improving Gaussian Splatting with Localized Points Management
Haosen Yang · Chenhao Zhang · Wenqing Wang · Marco Volino · Adrian Hilton · Li Zhang · Xiatian Zhu
|
ExHall D Poster #61 | |
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
Poster Session 5
Zijia Lu · ASM Iftekhar · Gaurav Mittal · Tianjian Meng · Xiawei Wang · Cheng Zhao · Rohith Kukkala · Ehsan Elhamifar · Mei Chen
|
ExHall D Poster #291 | |
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement
Poster Session 1
Qianhan Feng · Wenshuo Li · Tong Lin · Xinghao Chen
|
ExHall D Poster #383 | |
Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Poster Session 1
Xianing Chen · Si Huo · Borui Jiang · Hailin Hu · Xinghao Chen
|
ExHall D Poster #428 | |
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma · Huachen Gao · Haoge Deng · Zhengxiong Luo · Tiejun Huang · Lulu Tang · Xinlong Wang
|
ExHall D Poster #172 | |
SketchVideo: Sketch-based Video Generation and Editing
Poster Session 5
Feng-Lin Liu · Hongbo Fu · Xintao Wang · Weicai Ye · Pengfei Wan · Di ZHANG · Lin Gao
|
ExHall D Poster #222 | |
PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation
Poster Session 5
Xinting Hu · Haoran Wang · Jan Lenssen · Bernt Schiele
|
ExHall D Poster #264 | |
Exploration-Driven Generative Interactive Environments
Poster Session 6
Nedko Savov · Naser Kazemi · Mohammad Mahdi · Danda Paudel · Xi Wang · Luc Van Gool
|
ExHall D Poster #137 | |
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
ruojun xu · Weijie Xi · Xiaodi Wang · Yongbo Mao · Zach Cheng
|
ExHall D Poster #235 | |
PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection
Poster Session 1
Jianan Ye · Weiguang Zhao · Xi Yang · Guangliang Cheng · Kaizhu Huang
|
ExHall D Poster #110 | |
SoftShadow: Leveraging Soft Masks for Penumbra-Aware Shadow Removal
Poster Session 5
Xinrui Wang · Lanqing Guo · Xiyu Wang · Siyu Huang · Bihan Wen
|
ExHall D Poster #206 | |
LLaVA-Critic: Learning to Evaluate Multimodal Models
Poster Session 3
Tianyi Xiong · Xiyao Wang · Dong Guo · Qinghao Ye · Haoqi Fan · Quanquan Gu · Heng Huang · Chunyuan Li
|
ExHall D Poster #283 | |
ACL: Activating Capability of Linear Attention for Image Restoration
Poster Session 4
Yubin Gu · Yuan Meng · Jiayi Ji · Xiaoshuai Sun
|
ExHall D Poster #201 | |
MOS: Modeling Object-Scene Associations in Generalized Category Discovery
Poster Session 3
Zhengyuan Peng · Jinpeng Ma · Zhimin Sun · Ran Yi · Haichuan Song · Xin Tan · Lizhuang Ma
|
ExHall D Poster #428 | |
Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
Poster Session 2
Yiqun Mei · Mingming He · Li Ma · Julien Philip · Wenqi Xian · David M George · Xueming Yu · Gabriel Dedic · Ahmet Levent Taşel · Ning Yu · Vishal M. Patel · Paul Debevec
|
ExHall D Poster #6 | |
Rectified Diffusion Guidance for Conditional Generation
Poster Session 3
Mengfei Xia · Nan Xue · Yujun Shen · Ran Yi · Tieliang Gong · Yong-Jin Liu
|
ExHall D Poster #259 | |
PHGC: Procedural Heterogeneous Graph Completion for Natural Language Task Verification in Egocentric Videos
Poster Session 2
Xun Jiang · Zhiyi Huang · Xing Xu · Jingkuan Song · Fumin Shen · Heng Tao Shen
|
ExHall D Poster #309 | |
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images
Junxian Wu · Minheng Chen · Xinyi Ke · Tianwang Xun · Xiaoming Jiang · Hongyu Zhou · Lizhi Shao · Youyong Kong
|
||
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
Poster Session 5
Yang Liu · Qianqian Xu · Peisong Wen · Siran Dai · Qingming Huang
|
ExHall D Poster #288 | |
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection
Poster Session 6
wenqiao Li · Yao Gu · Xintao Chen · Xiaohao Xu · Ming Hu · Xiaonan Huang · Yingna Wu
|
ExHall D Poster #408 | |
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
Poster Session 2
Yumeng Liu · Xiaoxiao Long · Zemin Yang · Yuan Liu · Marc Habermann · Christian Theobalt · Yuexin Ma · Wenping Wang
|
ExHall D Poster #160 | |
CocoER: Aligning Multi-Level Feature by Competition and Coordination for Emotion Recognition
Poster Session 6
Xuli Shen · Hua Cai · Weilin Shen · Qing Xu · Dingding Yu · Weifeng Ge · Xiangyang Xue
|
ExHall D Poster #328 | |
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Xin Zhang · Robby T. Tan
|
ExHall D Poster #370 | |
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Poster Session 1
Xunzhi Zheng · Dan Xu
|
ExHall D Poster #77 | |
HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution
Poster Session 1
Yuxuan Jiang · Ho Man Kwan · jasmine peng · Ge Gao · Fan Zhang · Xiaoqing Zhu · Joel Sole · David Bull
|
ExHall D Poster #200 | |
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization
Poster Session 4
Siyan Dong · Shuzhe Wang · Shaohui Liu · Lulu Cai · Qingnan Fan · Juho Kannala · Yanchao Yang
|
ExHall D Poster #87 | |
SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation
Poster Session 3
Dekai Zhu · Yan Di · Stefan Gavranovic · Slobodan Ilic
|
ExHall D Poster #111 | |
Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability
Poster Session 6
Jianyang Zhang · Qianli Luo · Guowu Yang · Wenjing Yang · Weide Liu · Guosheng Lin · Fengmao Lv
|
ExHall D Poster #397 | |
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
Poster Session 2
Silin Cheng · Yang Liu · Xinwei He · Sebastien Ourselin · Lei Tan · Luo
|
ExHall D Poster #363 | |
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning
Poster Session 4
Buzhen Huang · Chen Li · Chongyang Xu · Dongyue Lu · Jinnan Chen · Yangang Wang · Gim Hee Lee
|
ExHall D Poster #160 | |
Sonata: Self-Supervised Learning of Reliable Point Representations
Xiaoyang Wu · Daniel DeTone · Duncan Frost · TIANWEI SHEN · Chris Xie · Nan Yang · Jakob Engel · Richard Newcombe · Hengshuang Zhao · Julian Straub
|
ExHall D Poster #109 | |
IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Poster Session 1
Yiren Song · Pei Yang · Hai Ci · Mike Zheng Shou
|
ExHall D Poster #273 | |
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering
Poster Session 5
Zhen Yang · Zhuo Tao · Qi Chen · Yuankai Qi · Liang Li · Anton van den Hengel · Qingming Huang
|
ExHall D Poster #356 | |
Uncertain Multimodal Intention and Emotion Understanding in the Wild
Poster Session 5
Qu Yang · QingHongYa Shi · Tongxin Wang · Mang Ye
|
ExHall D Poster #351 | |
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
Poster Session 6
Jian Liang · Wenke Huang · Guancheng Wan · Qu Yang · Mang Ye
|
ExHall D Poster #329 | |
On the Out-Of-Distribution Generalization of Large Multimodal Models
Poster Session 2
Xingxuan Zhang · Jiansheng Li · Wenjing Chu · junjia hai · Renzhe Xu · Yuqing Yang · Shikai Guan · Jiazheng Xu · Liping Jing · Peng Cui
|
ExHall D Poster #471 | |
MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Poster Session 4
Zifan Wang · Ziqing Chen · Junyu Chen · Jilong Wang · Yuxin Yang · Yunze Liu · Xueyi Liu · He Wang · Li Yi
|
ExHall D Poster #143 | |
Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking
Poster Session 3
Hongkai Wei · YANG YANG · Shijie Sun · Mingtao Feng · Xiangyu Song · Qi Lei · Hongli Hu · Rong Wang · Huansheng Song · Naveed Akhtar · Ajmal Mian
|
ExHall D Poster #309 | |
Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression
Poster Session 1
Lucas Relic · Roberto Azevedo · Yang Zhang · Markus Gross · Christopher Schroers
|
ExHall D Poster #217 | |
Towards Practical Real-Time Neural Video Compression
Poster Session 3
Zhaoyang Jia · Bin Li · Jiahao Li · Wenxuan Xie · Linfeng Qi · Houqiang Li · Yan Lu
|
ExHall D Poster #180 | |
Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow
Poster Session 6
Hanyu Zhou · Haonan Wang · Haoyue Liu · Yuxing Duan · Yi Chang · Luxin Yan
|
ExHall D Poster #165 | |
Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection
Houzhang Fang · Xiaolin Wang · Zengyang Li · Lu Wang · Qingshan Li · Yi Chang · Luxin Yan
|
ExHall D Poster #121 | |
TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion
Poster Session 4
Haoyue Liu · Jinghan Xu · Yi Chang · Hanyu Zhou · Haozhi Zhao · Lin Wang · Luxin Yan
|
ExHall D Poster #176 | |
HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation
Poster Session 1
Hongye Cheng · Tianyu Wang · guangsi shi · Zexing Zhao · Yanwei Fu
|
ExHall D Poster #69 | |
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Poster Session 2
Chenjie Cao · Chaohui Yu · Shang Liu · Fan Wang · Xiangyang Xue · Yanwei Fu
|
ExHall D Poster #58 | |
CustAny: Customizing Anything from A Single Example
Poster Session 5
Lingjie Kong · Kai WU · Chengming Xu · Xiaobin Hu · Wenhui Han · Jinlong Peng · Donghao Luo · Mengtian Li · Jiangning Zhang · Chengjie Wang · Yanwei Fu
|
ExHall D Poster #246 | |
Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors
Poster Session 5
Weilong Yan · Ming Li · Li Haipeng · Shuwei Shao · Robby T. Tan
|
ExHall D Poster #79 | |
Building Vision Models upon Heat Conduction
Poster Session 2
Zhaozhi Wang · Yue Liu · Yunjie Tian · Yunfan Liu · Yaowei Wang · Qixiang Ye
|
ExHall D Poster #413 | |
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
Poster Session 3
Jingzhou Luo · Yang Liu · weixing chen · Zhen Li · Yaowei Wang · Guanbin Li · Liang Lin
|
ExHall D Poster #337 | |
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
Poster Session 4
Niu Lian · Jun Li · Jinpeng Wang · Ruisheng Luo · Yaowei Wang · Shu-Tao Xia · Bin Chen
|
ExHall D Poster #295 | |
Unity in Diversity: Video Editing via Gradient-Latent Purification
Poster Session 5
Junyu Gao · Kunlin Yang · Xuan Yao · Yufan Hu
|
ExHall D Poster #224 | |
VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing
Poster Session 5
Juan Luis Gonzalez Bello · Xu Yao · Alex Whelan · Kyle Olszewski · Hyeongwoo Kim · Pablo Garrido
|
ExHall D Poster #175 | |
Multi-modal Medical Diagnosis via Large-small Model Collaboration
Poster Session 6
Wanyi Chen · Zihua Zhao · Jiangchao Yao · Ya Zhang · Jiajun Bu · Haishuai Wang
|
ExHall D Poster #442 | |
VSNet: Focusing on the Linguistic Characteristics of Sign Language
Poster Session 5
Yuhao Li · Xinyue Chen · Hongkai Li · Xiaorong Pu · Peng Jin · Yazhou Ren
|
ExHall D Poster #315 | |
Volumetrically Consistent 3D Gaussian Rasterization
Chinmay Talegaonkar · Yash Belhe · Ravi Ramamoorthi · Nicholas Antipa
|
ExHall D Poster #28 | |
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
Poster Session 3
Miaowei Wang · Yibo Zhang · Rui Ma · Weiwei Xu · Changqing Zou · Daniel Morris
|
ExHall D Poster #67 | |
Do Computer Vision Foundation Models Learn the Low-level Characteristics of the Human Visual System?
Yancheng Cai · Fei Yin · Dounia Hammou · Rafal Mantiuk
|
ExHall D Poster #404 | |
Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues
Poster Session 6
Sihong Huang · Jiaxin Wu · Xiaoyong Wei · Yi Cai · Dongmei Jiang · Yaowei Wang
|
ExHall D Poster #265 | |
Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM
Poster Session 3
Yizhou Huang · Yihua Cheng · Kezhi Wang
|
ExHall D Poster #136 | |
Resilient Sensor Fusion Under Adverse Sensor Failures via Multi-Modal Expert Fusion
Poster Session 2
Konyul Park · Yecheol Kim · Daehun Kim · Jun Won Choi
|
ExHall D Poster #129 | |
DiC: Rethinking Conv3x3 Designs in Diffusion Models
Poster Session 1
Yuchuan Tian · Jing Han · Chengcheng Wang · Yuchen Liang · Chao Xu · Hanting Chen
|
ExHall D Poster #220 | |
Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models
Poster Session 5
Yankai Jiang · Peng Zhang · Donglin Yang · Yuan Tian · Hai Lin · Xiaosong Wang
|
ExHall D Poster #474 | |
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
Poster Session 1
ruotian peng · Haiying He · Yake Wei · Yandong Wen · Di Hu
|
ExHall D Poster #361 | |
TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification
Poster Session 5
Dongyoon Yang · Jihu Lee · Yongdai Kim
|
ExHall D Poster #454 | |
S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting
Poster Session 1
Yecong Wan · Mingwen Shao · Yuanshuo Cheng · Wangmeng Zuo
|
ExHall D Poster #51 | |
Learning on Model Weights using Tree Experts
Poster Session 4
Eliahu Horwitz · Bar Cavia · Jonathan Kahana · Yedid Hoshen
|
ExHall D Poster #444 | |
Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks
Poster Session 6
Yong Xie · Weijie Zheng · Hanxun Huang · Guangnan Ye · Xingjun Ma
|
ExHall D Poster #436 | |
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Poster Session 1
Zixuan Ye · Huijuan Huang · Xintao Wang · Pengfei Wan · Di ZHANG · Wenhan Luo
|
ExHall D Poster #236 | |
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs
Poster Session 4
Yi Fang · Bowen Jin · Jiacheng Shen · Sirui Ding · Qiaoyu Tan · Jiawei Han
|
ExHall D Poster #349 | |
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
Poster Session 2
Xueqing Wu · Yuheng Ding · Bingxuan Li · Pan Lu · Da Yin · Kai-Wei Chang · Nanyun Peng
|
ExHall D Poster #396 | |
SFDM: Robust Decomposition of Geometry and Reflectance for Realistic Face Rendering from Sparse-view Images
Poster Session 6
Daisheng Jin · Jiangbei Hu · Baixin Xu · Yuxin Dai · Chen Qian · Ying He
|
ExHall D Poster #21 | |
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Poster Session 4
Jinhong Lin · Cheng-En Wu · Huanran Li · Jifan Zhang · Yu Hen Hu · Pedro Morgado
|
ExHall D Poster #403 | |
Distinguish Then Exploit: Source-free Open Set Domain Adaptation via Weight Barcode Estimation and Sparse Label Assignment
Poster Session 1
Weiming Liu · Jun Dan · Fan Wang · Xinting Liao · Junhao Dong · Hua Yu · Shunjie Dong · Lianyong Qi
|
ExHall D Poster #455 | |
DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation
Poster Session 3
Sang-Jun Park · Keun-Soo Heo · Dong-Hee Shin · Young-Han Son · Ji-Hye Oh · Tae-Eui Kam
|
ExHall D Poster #472 | |
Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space
Poster Session 4
Yi Liu · Wengen Li · Jihong Guan · Shuigeng Zhou · Yichao Zhang
|
ExHall D Poster #195 | |
HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation
Poster Session 6
Trong-Thuan Nguyen · Pha Nguyen · Jackson Cothren · Alper Yilmaz · Khoa Luu
|
ExHall D Poster #287 | |
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
Poster Session 2
Wenzhuo Liu · Wenshuo Wang · Yicheng Qiao · Qiannan Guo · Jiayin Zhu · Pengfei Li · Zilong Chen · Huiming Yang · Zhiwei Li · Lening Wang · Tiao Tan · Huaping Liu
|
ExHall D Poster #143 | |
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Poster Session 1
Xiaozhong Ji · Xiaobin Hu · Zhihong Xu · Junwei Zhu · Chuming Lin · Qingdong He · Jiangning Zhang · Donghao Luo · Yi Chen · Qin Lin · qinglin lu · Chengjie Wang
|
ExHall D Poster #3 | |
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation
Poster Session 1
Yichen Xie · Runsheng Xu · Tong He · Jyh-Jing Hwang · Katie Z Luo · Jingwei Ji · Hubert Lin · Letian Chen · Yiren Lu · Zhaoqi Leng · Dragomir Anguelov · Mingxing Tan
|
ExHall D Poster #136 | |
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency
Poster Session 5
Yikai Wang · Chenjie Cao · Junqiu Yu · Ke Fan · Xiangyang Xue · Yanwei Fu
|
ExHall D Poster #208 | |
ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning
Poster Session 1
Zhenyang Liu · Yikai Wang · Sixiao Zheng · Tongying Pan · Longfei Liang · Yanwei Fu · Xiangyang Xue
|
ExHall D Poster #338 | |
AnimateAnything: Consistent and Controllable Animation for Video Generation
Poster Session 6
guojun lei · Chi Wang · Rong Zhang · Yikai Wang · Hong Li · Weiwei Xu
|
ExHall D Poster #169 | |
MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction
Poster Session 1
Xiaohao Xu · Feng Xue · Shibo Zhao · Yike Pan · Sebastian Scherer · Xiaonan Huang
|
ExHall D Poster #64 | |
POSTA: A Go-to Framework for Customized Artistic Poster Generation
Poster Session 6
Haoyu Chen · Xiaojie Xu · Wenbo Li · Jingjing Ren · Tian Ye · Songhua Liu · Ying-Cong Chen · Lei Zhu · Xinchao Wang
|
ExHall D Poster #241 | |
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos
Yuzheng Liu · Siyan Dong · Shuzhe Wang · Yingda Yin · Yanchao Yang · Qingnan Fan · Baoquan Chen
|
ExHall D Poster #78 | |
Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
Poster Session 3
Jikang Cheng · Zhiyuan Yan · Ying Zhang · Li Hao · Jiaxin Ai · Qin Zou · Chen Li · Zhongyuan Wang
|
ExHall D Poster #313 | |
From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective
Poster Session 4
Chen Zhao · Zhizhou Chen · Yunzhe Xu · Enxuan Gu · Jian Li · Zili Yi · qian Wang · Jian Yang · Ying Tai
|
ExHall D Poster #203 | |
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method
Poster Session 1
Pan Yin · Kaiyu Li · Xiangyong Cao · Jing Yao · Lei Liu · Xueru Bai · Feng Zhou · Deyu Meng
|
ExHall D Poster #127 | |
Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning
Poster Session 2
Tianxiang Yin · Ningzhong Liu · Han Sun
|
ExHall D Poster #456 | |
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
Poster Session 2
Jinxiu Liu · Shaoheng Lin · Yinxiao Li · Ming-Hsuan Yang
|
ExHall D Poster #67 | |
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
Xiaoying Xing · Avinab Saha · Junfeng He · Susan Hao · Paul Vicol · Moonkyung Ryu · Gang Li · Sahil Singla · Sarah Young · Yinxiao Li · Feng Yang · Deepak Ramachandran
|
ExHall D Poster #259 | |
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
Poster Session 4
Kyungmin Lee · Xiaohang Li · Qifei Wang · Junfeng He · Junjie Ke · Ming-Hsuan Yang · Irfan Essa · Jinwoo Shin · Feng Yang · Yinxiao Li
|
ExHall D Poster #257 | |
ReNeg: Learning Negative Embedding with Reward Guidance
Xiaomin Li · yixuan liu · Takashi Isobe · Xu Jia · Qinpeng Cui · Dong Zhou · Dong Li · You He · Huchuan Lu · Zhongdao Wang · Emad Barsoum
|
ExHall D Poster #249 | |
DL2G: Degradation-guided Local-to-Global Restoration for Eyeglass Reflection Removal
Poster Session 4
Yizhilv · Xiao Lu · Hong Ding · Jingbo Hu · Zhi Jiang · Chunxia Xiao
|
ExHall D Poster #19 | |
Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions
Poster Session 1
Ting-Hsuan Liao · Yi Zhou · Yu Shen · Chun-Hao P. Huang · Saayan Mitra · Jia-Bin Huang · Uttaran Bhattacharya
|
ExHall D Poster #162 | |
UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping
Poster Session 3
Wenbo Wang · Fangyun Wei · Lei Zhou · Xi Chen · Lin Luo · Xiaohan Yi · Yizhong Zhang · Yaobo Liang · Chang Xu · Yan Lu · Jiaolong Yang · Baining Guo
|
ExHall D Poster #149 | |
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens
Poster Session 4
Chi Su · Xiaoxuan Ma · Jiajun Su · Yizhou Wang
|
ExHall D Poster #93 | |
FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling
Hang Ye · Xiaoxuan Ma · Hai Ci · Wentao Zhu · Yizhou Wang
|
ExHall D Poster #12 | |
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Jinlu Zhang · Yixin Chen · Zan Wang · Jie Yang · Yizhou Wang · Siyuan Huang
|
ExHall D Poster #157 | |
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Poster Session 5
Jinhui Yi · Syed Talal Wasim · Yanan Luo · Muzammal Naseer · Jürgen Gall
|
ExHall D Poster #296 | |
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
Poster Session 4
Zongjian Li · Bin Lin · Yang Ye · Liuhan Chen · Xinhua Cheng · Shenghai Yuan · Li Yuan
|
ExHall D Poster #188 | |
Generalizing Deepfake Video Detection with Plug-and-Play: Video-Level Blending and Spatiotemporal Adapter Tuning
Poster Session 3
Zhiyuan Yan · Yandan Zhao · Shen Chen · Mingyi Guo · Xinghe Fu · Taiping Yao · Shouhong Ding · Yunsheng Wu · Li Yuan
|
ExHall D Poster #188 | |
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Poster Session 6
Haicheng Wang · Chen Ju · Weixiong Lin · Mengting Chen · Shuai Xiao · Yixuan Huang · Chang Liu · mingshuai Yao · Jinsong Lan · Ying Chen · Qingwen Liu · Yanfeng Wang
|
ExHall D Poster #349 | |
Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression
Poster Session 6
Zichong Meng · Yiming Xie · Xiaogang Peng · Zeyu Han · Huaizu Jiang
|
ExHall D Poster #161 | |
UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
Poster Session 4
Dianmo Sheng · Dongdong Chen · Zhentao Tan · Qiankun Liu · Qi Chu · Tao Gong · Bin Liu · Jing Han · Wenbin Tu · Shengwei Xu · Nenghai Yu
|
ExHall D Poster #419 | |
Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression
Poster Session 3
Zhenqi Dai · Ting Liu · Yanning Zhang
|
ExHall D Poster #48 | |
Dual-Granularity Semantic Guided Sparse Routing Diffusion Model for General Pansharpening
Poster Session 3
Yinghui Xing · Qu Li Tao · Shizhou Zhang · Di Xu · YingkunYang · Yanning Zhang
|
ExHall D Poster #192 | |
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning
Poster Session 4
Zijian Gao · Wangwang Jia · Xingxing Zhang · Dulan Zhou · Kele Xu · Feng Dawei · Yong Dou · Xinjun Mao · Huaimin Wang
|
ExHall D Poster #449 | |
Temporal Action Detection Model Compression by Progressive Block Drop
Poster Session 6
Xiaoyong Chen · Yong Guo · Jiaming Liang · Sitong Zhuang · Runhao Zeng · Xiping Hu
|
ExHall D Poster #294 | |
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models
Poster Session 6
Wentao Qu · Jing Wang · Yongshun Gong · Xiaoshui Huang · Liang Xiao
|
ExHall D Poster #112 | |
GRAE-3DMOT: Geometry Relation-Aware Encoder for Online 3D Multi-Object Tracking
Poster Session 3
Hyunseop Kim · Hyo-Jun Lee · Yonguk Lee · Jinu Lee · Hanul Kim · Yeong Jun Koh
|
ExHall D Poster #101 | |
SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection
Poster Session 4
Hyo-Jun Lee · Yeong Jun Koh · Hanul Kim · Hyunseop Kim · Yonguk Lee · Jinu Lee
|
ExHall D Poster #127 | |
Vision-Language Gradient Descent-driven All-in-One Deep Unfolding Networks
Poster Session 2
Haijin Zeng · Xiangming Wang · Yongyong Chen · Jingyong Su · Jie Liu
|
ExHall D Poster #206 | |
Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing
Poster Session 2
Shiyang Zhou · Haijin Zeng · Yunfan Lu · Tong Shao · Ke Tang · Yongyong Chen · Jie Liu · Jingyong Su
|
ExHall D Poster #329 | |
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
Poster Session 2
Mohsen Gholami · Mohammad Akbari · Kevin Cannons · Yong Zhang
|
ExHall D Poster #381 | |
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
Poster Session 2
Saeed Ranjbar Alvar · Gursimran Singh · Mohammad Akbari · Yong Zhang
|
ExHall D Poster #383 | |
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
Poster Session 5
Dana Cohen-Bar · Daniel Cohen-Or · Gal Chechik · Yoni Kasten
|
ExHall D Poster #34 | |
Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and Benchmark
Poster Session 6
Zhuoran Du · Shaodi You · Cheng Cheng · Shikui Wei
|
ExHall D Poster #182 | |
Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild
Poster Session 5
Junhyeong Cho · Kim Youwang · Hunmin Yang · Tae-Hyun Oh
|
ExHall D Poster #164 | |
DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models
Poster Session 3
Radu Alexandru Rosu · Keyu Wu · Yao Feng · Youyi Zheng · Michael J. Black
|
ExHall D Poster #18 | |
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
Poster Session 4
Jiaxin Zhang · Junjun Jiang · Youyu Chen · Kui Jiang · Xianming Liu
|
ExHall D Poster #337 | |
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
Poster Session 2
Seokhyeon Hong · Chaelin Kim · Serin Yoon · Junghyun Nam · Sihun Cha · Junyong Noh
|
ExHall D Poster #172 | |
Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities
Poster Session 2
Michele Mazzamuto · Antonino Furnari · Yoichi Sato · Giovanni Maria Farinella
|
ExHall D Poster #281 | |
Post-pre-training for Modality Alignment in Vision-Language Foundation Models
Poster Session 1
Shin'ya Yamaguchi · Dewei Feng · Sekitoshi Kanai · Kazuki Adachi · Daiki Chijiwa
|
ExHall D Poster #390 | |
Pippo: High-Resolution Multi-View Humans from a Single Image
Poster Session 4
Yash Kant · Ethan Weber · Jin Kyu Kim · Rawal Khirodkar · Zhaoen Su · Julieta Martinez · Igor Gilitschenski · Shunsuke Saito · Timur Bagautdinov
|
ExHall D Poster #55 | |
Playing the Fool: Jailbreaking LLMs and Multimodal LLMs with Out-of-Distribution Strategy
Poster Session 6
Joonhyun Jeong · Seyun Bae · Yeonsung Jung · Jaeryong Hwang · Eunho Yang
|
ExHall D Poster #362 | |
Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing
Poster Session 5
Yoonjeon Kim · Soohyun Ryu · Yeonsung Jung · Hyunkoo Lee · Joowon Kim · June Yong Yang · Jaeryong Hwang · Eunho Yang
|
ExHall D Poster #231 | |
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
Poster Session 1
Hao Yu · Tangyu Jiang · Shuning Jia · Shannan Yan · Shunning Liu · Haolong Qian · Guanghao Li · Shuting Dong · Chun Yuan
|
ExHall D Poster #416 | |
Reversing Flow for Image Restoration
Poster Session 2
Haina Qin · Wenyang Luo · Bing Li · Weiming Hu · libin wang · DanDan Zheng · Jingdong Chen · Ming Yang
|
ExHall D Poster #208 | |
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Poster Session 1
Wen Yin · Yong Wang · Guiduo Duan · Dongyang Zhang · XIN Hu · Yuan-Fang Li · Tao He
|
ExHall D Poster #354 | |
Consistent and Controllable Image Animation with Motion Diffusion Models
Poster Session 2
Xin Ma · Yaohui Wang · Gengyun Jia · Xinyuan Chen · Tien-Tsin Wong · Yuan-Fang Li · Cunjian Chen
|
ExHall D Poster #184 | |
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
Poster Session 4
Edson Araujo · Andrew Rouditchenko · Yuan Gong · Saurabhchand Bhati · Samuel Thomas · Brian Kingsbury · Leonid Karlinsky · Rogerio Feris · James Glass · Hilde Kuehne
|
ExHall D Poster #287 | |
Detecting Adversarial Data Using Perturbation Forgery
Poster Session 3
Qian Wang · Chen Li · Yuchen Luo · Hefei Ling · Shijuan Huang · Ruoxi Jia · Ning Yu
|
ExHall D Poster #312 | |
Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising
Poster Session 3
Yuchen Wang · Hongyuan Wang · Lizhi Wang · Xin Wang · Lin Zhu · Wanxuan Lu · Hua Huang
|
ExHall D Poster #194 | |
One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception
Poster Session 1
Yuchen Xia · Quan Yuan · Guiyang Luo · Xiaoyuan Fu · Yang Li · Xuanhan Zhu · Tianyou Luo · Siheng Chen · Jinglin Li
|
ExHall D Poster #133 | |
VolFormer: Explore More Comprehensive Cube Interaction for Hyperspectral Image Restoration and Beyond
Poster Session 6
Dabing Yu · Zheng Gao
|
ExHall D Poster #183 | |
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
Ming Yan · Xincheng Lin · Yuhua Luo · Shuqi Fan · Yudi Dai · Qixin Zhong · Lincai Zhong · Yuexin Ma · Lan Xu · Chenglu Wen · Siqi Shen · Cheng Wang
|
ExHall D Poster #159 | |
DroneSplat: 3D Gaussian Splatting for Robust 3D Reconstruction from In-the-Wild Drone Imagery
Jiadong Tang · Yu Gao · Dianyi Yang · Liqi Yan · Yufeng Yue · Yi Yang
|
ExHall D Poster #62 | |
TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
Poster Session 4
Cheng Yang · Yang Sui · Jinqi Xiao · Lingyi Huang · Yu Gong · Chendi Li · Jinghua Yan · Yu Bai · Ponnuswamy Sadayappan · Xia Hu · Bo Yuan
|
ExHall D Poster #381 | |
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
Poster Session 2
Yudong Han · Qingpei Guo · Liyuan Pan · Liu Liu · Yu Guan · Ming Yang
|
ExHall D Poster #299 | |
Leveraging Global Stereo Consistency for Category-Level Shape and 6D Pose Estimation from Stereo Images
Poster Session 4
Junning Qiu · Minglei Lu · Fei Wang · Yu Guo · Yonggen Ling
|
ExHall D Poster #97 | |
Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling
Yinuo Wang · Yanbo Fan · Xuan Wang · Yu Guo · Fei Wang
|
ExHall D Poster #3 | |
Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning
Poster Session 5
Can Küçüksözen · Yucel Yemez
|
ExHall D Poster #415 | |
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Poster Session 3
Yuhan Wang · Fangzhou Hong · Shuai Yang · Liming Jiang · Wayne Wu · Chen Change Loy
|
ExHall D Poster #61 | |
UNIALIGN: Scaling Multimodal Alignment within One Unified Model
Poster Session 6
bo zhou · Liulei Li · Yujia Wang · 刘华峰 Liu · Yazhou Yao · Wenguan Wang
|
ExHall D Poster #335 | |
VITED: Video Temporal Evidence Distillation
Poster Session 2
Yujie Lu · Yale Song · Lorenzo Torresani · William Yang Wang · Tushar Nagarajan
|
ExHall D Poster #298 | |
Adaptive Rectangular Convolution for Remote Sensing Pansharpening
Poster Session 4
Xueyang Wang · Zhixin Zheng · Jiandong Shao · Yule Duan · Liang-Jian Deng
|
ExHall D Poster #197 | |
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera
Poster Session 6
Yuliang Guo · Sparsh Garg · S. Mahdi H. Miangoleh · Xinyu Huang · Liu Ren
|
ExHall D Poster #81 | |
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Poster Session 2
Yue Chen · Xingyu Chen · Anpei Chen · Gerard Pons-Moll · Yuliang Xiu
|
ExHall D Poster #93 | |
Ref-GS: Directional Factorization for 2D Gaussian Splatting
Poster Session 6
Youjia Zhang · Anpei Chen · Yumin Wan · Zikai Song · Junqing Yu · Yawei Luo · Wei Yang
|
ExHall D Poster #30 | |
Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
Poster Session 6
Lihan Jiang · Kerui Ren · Mulin Yu · Linning Xu · Junting Dong · Tao Lu · Feng Zhao · Dahua Lin · Bo Dai
|
ExHall D Poster #62 | |
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models
Poster Session 6
Kwan Yun · Seokhyeon Hong · Chaelin Kim · Junyong Noh
|
ExHall D Poster #159 | |
H2ST: Hierarchical Two-Sample Tests for Continual Out-of-Distribution Detection
Poster Session 3
Yuhang Liu · Wenjie Zhao · Yunhui Guo
|
ExHall D Poster #456 | |
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
Poster Session 3
Kwan Yun · Chaelin Kim · Hangyeul Shin · Junyong Noh
|
ExHall D Poster #16 | |
Task-Specific Gradient Adaptation for Few-Shot One-Class Classification
Poster Session 6
Yunlong Li · Xiabi Liu · Liyuan Pan · Yuchen Ren
|
ExHall D Poster #422 | |
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis
Poster Session 6
Yun Chang · Leonor Fermoselle · Duy Ta · Bernadette Bucher · Luca Carlone · Jiuguang Wang
|
ExHall D Poster #316 | |
Attribute-Missing Multi-view Graph Clustering
Poster Session 5
Bowen Zhao · Qianqian Wang · Zhengming Ding · Quanxue Gao
|
ExHall D Poster #461 | |
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
Poster Session 4
Yuncong Yang · Han Yang · Jiachen Zhou · Peihao Chen · Hongxin Zhang · Yilun Du · Chuang Gan
|
ExHall D Poster #141 | |
ROICtrl: Boosting Instance Control for Visual Generation
Poster Session 5
Yuchao Gu · Yipin Zhou · Yunfan Ye · Yixin Nie · Licheng Yu · Pingchuan Ma · Kevin Qinghong Lin · Mike Zheng Shou
|
ExHall D Poster #251 | |
HuPerFlow: A Comprehensive Benchmark for Human vs. Machine Motion Estimation Comparison
Poster Session 5
Yung-Hao Yang · Zitang Sun · Taiki Fukiage · Shin'ya Nishida
|
ExHall D Poster #166 | |
On Denoising Walking Videos for Gait Recognition
Poster Session 3
Dongyang Jin · Chao Fan · Jingzhe Ma · Jingkai Zhou · Weihua Chen · Shiqi Yu
|
ExHall D Poster #162 | |
NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics
Poster Session 6
Chenhao Li · Taishi Ono · Takeshi Uemori · Sho Nitta · Hajime Mihara · Alexander Gatto · Hajime Nagahara · Yusuke Moriuchi
|
ExHall D Poster #31 | |
PromptHMR: Promptable Human Mesh Recovery
Poster Session 1
Yufu Wang · Yu Sun · Priyanka Patel · Kostas Daniilidis · Michael J. Black · Muhammed Kocabas
|
ExHall D Poster #91 | |
UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units
Huakun Liu · Hiroki Ota · Xin Wei · Yutaro Hirao · Monica Perusquia-Hernandez · Hideaki Uchiyama · Kiyoshi Kiyokawa
|
ExHall D Poster #165 | |
FedMIA: An Effective Membership Inference Attack Exploiting "All for One" Principle in Federated Learning
Poster Session 4
Gongxi Zhu · Donghao Li · Hanlin Gu · Yuan Yao · Lixin Fan · Yuxing Han
|
ExHall D Poster #460 | |
A Physics-Informed Blur Learning Framework for Imaging Systems
Poster Session 3
liqun.chen · Yuxuan Li · Jun Dai · Jinwei Gu · Tianfan Xue
|
ExHall D Poster #24 | |
Segment Any-Quality Images with Generative Latent Space Enhancement
Poster Session 1
Guangqian Guo · Yong Guo · Xuehui Yu · Wenbo Li · Yaoxing Wang · Shan Gao
|
ExHall D Poster #207 | |
Sparse2DGS: Geometry-Prioritized Gaussian Splatting for Surface Reconstruction from Sparse Views
Poster Session 3
Jiang Wu · Rui Li · Yu Zhu · Rong Guo · Jinqiu Sun · Yanning Zhang
|
ExHall D Poster #62 | |
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving
Poster Session 1
Zebin Xing · Xingyu Zhang · Yang Hu · Bo Jiang · Tong He · Qian Zhang · Xiaoxiao Long · Wei Yin
|
ExHall D Poster #134 | |
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
Poster Session 2
Kaiyue Sun · Kaiyi Huang · Xian Liu · Yue Wu · Zihan Xu · Zhenguo Li · Xihui Liu
|
ExHall D Poster #290 | |
VoCo-LLaMA: Towards Vision Compression with Large Language Models
Poster Session 6
Xubing Ye · Yukang Gan · Xiaoke Huang · Yixiao Ge · Yansong Tang
|
ExHall D Poster #353 | |
ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models
Poster Session 5
Xubing Ye · Yukang Gan · Yixiao Ge · Xiao-Ping Zhang · Yansong Tang
|
ExHall D Poster #376 | |
Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising
Poster Session 6
Yongli Xiang · Ziming Hong · Lina Yao · Dadong Wang · Tongliang Liu
|
ExHall D Poster #433 | |
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
Poster Session 5
Chiara Plizzari · Alessio Tonioni · Yongqin Xian · Achin Kulshrestha · Federico Tombari
|
ExHall D Poster #297 | |
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning
Poster Session 6
Peng Wu · Xiankai Lu · Hao Hu · Yongqin Xian · Jianbing Shen · Wenguan Wang
|
ExHall D Poster #398 | |
GASP: Gaussian Avatars with Synthetic Priors
Poster Session 1
Jack Saunders · Charlie Hewitt · Yanan Jian · Marek Kowalski · Tadas Baltrusaitis · Yiye Chen · Darren Cosker · Virginia Estellers · Nicholas Gydé · Vinay P. Namboodiri · Benjamin E Lundell
|
ExHall D Poster #10 | |
GG-SSMs: Graph-Generating State Space Models
Poster Session 6
Nikola Zubic · Davide Scaramuzza
|
ExHall D Poster #257 | |
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis
Poster Session 2
Yousef Yeganeh · Ioannis Charisiadis · Marta Hasny · Martin Hartenberger · Björn Ommer · Nassir Navab · Azade Farshad · Ehsan Adeli
|
ExHall D Poster #222 | |
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation
Poster Session 3
Yuying Ge · Yizhuo Li · Yixiao Ge · Ying Shan
|
ExHall D Poster #282 | |
A Simple Data Augmentation for Feature Distribution Skewed Federated Learning
Poster Session 5
Yunlu Yan · Huazhu Fu · Yuexiang Li · Jinheng Xie · Jun Ma · Guang Yang · Lei Zhu
|
ExHall D Poster #451 | |
Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning
Zichen Tian · Yaoyao Liu · Qianru Sun
|
ExHall D Poster #188 | |
Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Poster Session 5
Junying Wang · Hongyuan Zhang · Yuan Yuan
|
ExHall D Poster #259 | |
Birth and Death of a Rose
Poster Session 6
Chen Geng · Yunzhi Zhang · Shangzhe Wu · Jiajun Wu
|
ExHall D Poster #11 | |
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
Poster Session 5
Rui Qian · Shuangrui Ding · Xiaoyi Dong · Pan Zhang · Yuhang Zang · Yuhang Cao · Dahua Lin · Jiaqi Wang
|
ExHall D Poster #289 | |
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Poster Session 6
Xuweiyi Chen · Markus Marks · Zezhou Cheng
|
ExHall D Poster #377 | |
Conical Visual Concentration for Efficient Large Vision-Language Models
Poster Session 3
Long Xing · Qidong Huang · Xiaoyi Dong · Jiajie Lu · Pan Zhang · Yuhang Zang · Yuhang Cao · Conghui He · Jiaqi Wang · Feng Wu · Dahua Lin
|
ExHall D Poster #378 | |
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
Poster Session 4
Junbo Niu · Yifei Li · Ziyang Miao · Chunjiang Ge · Zhou Yuanhang · Qihao He · Xiaoyi Dong · Haodong Duan · Shuangrui Ding · Rui Qian · Pan Zhang · Yuhang Zang · Yuhang Cao · Conghui He · Jiaqi Wang
|
ExHall D Poster #297 | |
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning
Poster Session 5
Ren Wang · Haoliang Sun · Yuxiu Lin · Chuanhui Zuo · Yongshun Gong · Yilong Yin · Wenjia Meng
|
ExHall D Poster #460 | |
GaussianIP: Identity-Preserving Realistic 3D Human Generation via Human-Centric Diffusion Prior
Poster Session 1
Zichen Tang · Yuan Yao · Miaomiao Cui · Liefeng Bo · Hongyu Yang
|
ExHall D Poster #17 | |
A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models
Poster Session 4
Keyu Tu · Mengqi Huang · Zhuowei Chen · Zhendong Mao
|
ExHall D Poster #258 | |
Dragin3D: Image Editing by Dragging in 3D Space
Poster Session 5
Weiran Guang · Xiaoguang Gu · Mengqi Huang · Zhendong Mao
|
ExHall D Poster #43 | |
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
Poster Session 3
Hao Du · Bo Wu · Yan Lu · Zhendong Mao
|
ExHall D Poster #301 | |
FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation
Poster Session 1
Fengyi Fu · Lei Zhang · Mengqi Huang · Zhendong Mao
|
ExHall D Poster #239 | |
MambaIC: State Space Models for High-Performance Learned Image Compression
Poster Session 4
Fanhu Zeng · Hao Tang · Yihua Shao · Siyu Chen · Ling Shao · Yan Wang
|
ExHall D Poster #213 | |
BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models
Poster Session 6
Zenghui Yuan · Jiawen Shi · Pan Zhou · Neil Zhenqiang Gong · Lichao Sun
|
ExHall D Poster #361 | |
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation
Poster Session 3
Lingan Zeng · Guohong Huang · Yi-Lin Wei · Shengbo Gu · Yu-Ming Tang · Jingke Meng · Wei-Shi Zheng
|
ExHall D Poster #163 | |
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
Poster Session 6
Ruijie Lu · Yixin Chen · Junfeng Ni · Baoxiong Jia · Yu Liu · Diwen Wan · Gang Zeng · Siyuan Huang
|
ExHall D Poster #60 | |
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Poster Session 2
Yu Cao · Zengqun Zhao · Ioannis Patras · Shaogang Gong
|
ExHall D Poster #224 | |
Six-CD: Benchmarking Concept Removals for Text-to-image Diffusion Models
Poster Session 6
Jie Ren · Kangrui Chen · Yingqian Cui · Shenglai Zeng · Hui Liu · Yue Xing · Jiliang Tang · Lingjuan Lyu
|
ExHall D Poster #248 | |
NADER: Neural Architecture Design via Multi-Agent Collaboration
Poster Session 1
Zekang Yang · Wang ZENG · Sheng Jin · Chen Qian · Ping Luo · Wentao Liu
|
ExHall D Poster #411 | |
SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity
Poster Session 1
Chengzhi Wu · Yuxin Wan · Hao Fu · Julius Pfrommer · Zeyun Zhong · Junwei Zheng · Jiaming Zhang · Jürgen Beyerer
|
ExHall D Poster #109 | |
SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Shaoan Xie · Lingjing Kong · Yujia Zheng · Yu Yao · Zeyu Tang · Eric P. Xing · Guangyi Chen · Kun Zhang
|
ExHall D Poster #348 | |
Event-based Video Super-Resolution via State Space Models
Poster Session 3
Zeyu Xiao · Xinchao Wang
|
ExHall D Poster #182 | |
NoPain: No-box Point Cloud Attack via Optimal Transport Singular Boundary
Poster Session 1
Zezeng Li · Xiaoyu Du · Na Lei · Liming Chen · Weimin Wang
|
ExHall D Poster #317 | |
Automated Proof of Polynomial Inequalities via Reinforcement Learning
Poster Session 1
Banglong Liu · Niuniu Qi · Xia Zeng · Lydia Dehbi · Zhengfeng Yang
|
ExHall D Poster #467 | |
Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework
Poster Session 2
Hanrui Zhao · Niuniu Qi · Mengxin Ren · Banglong Liu · Shuming Shi · Zhengfeng Yang
|
ExHall D Poster #467 | |
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian · Zhongliang Guo · Bowen Deng · Chun Tong Lei · Shuai Zhao · Chun Pong Lau · Xiaopeng Hong · Michael Pound
|
ExHall D Poster #410 | |
Instant Adversarial Purification with Adversarial Consistency Distillation
Poster Session 5
Chun Tong Lei · Hon Ming Yam · Zhongliang Guo · Yifei Qian · Chun Pong Lau
|
ExHall D Poster #316 | |
UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
Poster Session 6
Yichong Lu · Yichi Cai · Shangzhan Zhang · Hongyu Zhou · Haoji Hu · Huimin Yu · Andreas Geiger · Yiyi Liao
|
ExHall D Poster #130 | |
CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images
Poster Session 3
Chen Cheng · Jiacheng Wei · Tianrun Chen · Chi Zhang · Xiaofeng Yang · Shangzhan Zhang · Bingchen Yang · Chuan-Sheng Foo · Guosheng Lin · Qixing Huang · Fayao Liu
|
ExHall D Poster #40 | |
IndoorGS: Geometric Cues Guided Gaussian Splatting for Indoor Scene Reconstruction
Poster Session 1
Cong Ruan · Yuesong Wang · Bin Zhang · Lili Ju · Tao Guan
|
ExHall D Poster #63 | |
FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors
Poster Session 6
Changlong Shi · He Zhao · Bingjie Zhang · Mingyuan Zhou · Dandan Guo · Yi Chang
|
ExHall D Poster #431 | |
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior
Poster Session 1
Haitao Wu · Qing Li · Changqing Zhang · Zhen He · Xiaomin Ying
|
ExHall D Poster #196 | |
HVI: A New Color Space for Low-light Image Enhancement
Poster Session 2
Qingsen Yan · Yixu Feng · Cheng Zhang · Guansong Pang · Kangbiao Shi · Peng Wu · Wei Dong · Jinqiu Sun · Yanning Zhang
|
ExHall D Poster #22 | |
MotionPRO: Exploring the Role of Pressure in Human MoCap and Beyond
Shenghao Ren · Yi Lu · Jiayi Huang · Jiayi Zhao · He Zhang · Tao Yu · Qiu Shen · Xun Cao
|
ExHall D Poster #152 | |
SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective Fusion
Poster Session 4
Xuan Zhu · Jijun Xiang · Xianqi Wang · Longliang Liu · Yu Wang · Hong Zhang · Fei Guo · Xin Yang
|
ExHall D Poster #75 | |
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
Poster Session 3
Rolandos Alexandros Potamias · Jinglei Zhang · Jiankang Deng · Stefanos Zafeiriou
|
ExHall D Poster #153 | |
DTOS: Dynamic Time Object Sensing with Large Multimodal Model
Poster Session 3
Jirui Tian · Jinrong Zhang · Shenglan Liu · Luhao Xu · Zhixiong Huang · Gao Huang
|
ExHall D Poster #302 | |
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
Poster Session 4
Tiantian Geng · Jinrui Zhang · Qingni Wang · Teng Wang · Jinming Duan · Feng Zheng
|
ExHall D Poster #302 | |
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation
Poster Session 6
Mingfei Han · Liang Ma · Kamila Zhumakhanova · Ekaterina Radionova · Jingyi Zhang · Xiaojun Chang · Xiaodan Liang · Ivan Laptev
|
ExHall D Poster #136 | |
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Poster Session 2
Changan Chen · Juze Zhang · Shrinidhi Kowshika Lakshmikanth · Yusu Fang · Ruizhi Shao · Gordon Wetzstein · Li Fei-Fei · Ehsan Adeli
|
ExHall D Poster #73 | |
ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression
Poster Session 2
Wei Jiang · Junru Li · Kai Zhang · Li zhang
|
ExHall D Poster #188 | |
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes
Poster Session 4
Kaiwei Zhang · Dandan Zhu · Xiongkuo Min · Guangtao Zhai
|
ExHall D Poster #35 | |
Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor
Poster Session 1
Hao Yu · Xin Yang · Le Zhang · Hanlin Gu · Tianrui Li · Lixin Fan · Qiang Yang
|
ExHall D Poster #450 | |
Structure-Aware Correspondence Learning for Relative Pose Estimation
Yihan Chen · Wenfei Yang · Huan Ren · Shifeng Zhang · Tianzhu Zhang · Feng Wu
|
ExHall D Poster #93 | |
Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion
Poster Session 6
ZhiFei Chen · Tianshuo Xu · Wenhang Ge · Leyi Wu · Dongyu Yan · Jing He · Luozhou Wang · Lu Zeng · Shunsi Zhang · Ying-Cong Chen
|
ExHall D Poster #33 | |
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Poster Session 2
Jiantao Lin · Xin Yang · Meixi Chen · Xu Yingjie · Dongyu Yan · Leyi Wu · Xinli Xu · Lie XU · Shunsi Zhang · Ying-Cong Chen
|
ExHall D Poster #41 | |
EVOS: Efficient Implicit Neural Training via EVOlutionary Selector
Poster Session 6
Weixiang Zhang · Shuzhao Xie · Chengwei Ren · Siyi Xie · Chen Tang · Shijia Ge · Mingzi Wang · Zhi Wang
|
ExHall D Poster #414 | |
Taming Teacher Forcing for Masked Autoregressive Video Generation
Poster Session 2
Deyu Zhou · Quan Sun · Yuang Peng · Kun Yan · Runpei Dong · Duomin Wang · Zheng Ge · Nan Duan · Xiangyu Zhang
|
ExHall D Poster #192 | |
LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene
Poster Session 4
Xiaoyu Zhang · Weihong Pan · Chong Bao · Xiyu Zhang · Xiaojun Xiang · Hanqing Jiang · Hujun Bao
|
ExHall D Poster #26 | |
AniMo: Species-Aware Model for Text-Driven Animal Motion Generation
Poster Session 1
Xuan Wang · Kai Ruan · Xing Zhang · Gaoang Wang
|
ExHall D Poster #163 | |
Question-Aware Gaussian Experts for Audio-Visual Question Answering
Hongyeob Kim · Inyoung Jung · Dayoon Suh · Youjia Zhang · Sangmin Lee · Sungeun Hong
|
ExHall D Poster #290 | |
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
Poster Session 1
Ziyi Wang · Yanran Zhang · Jie Zhou · Jiwen Lu
|
ExHall D Poster #107 | |
BHViT: Binarized Hybrid Vision Transformer
Poster Session 1
Tian Gao · Yu Zhang · Zhiyuan Zhang · Huajun Liu · Kaijie Yin · Cheng-Zhong Xu · Hui Kong
|
ExHall D Poster #323 | |
Knowledge Bridger: Towards Training-Free Missing Modality Completion
Poster Session 5
Guanzhou Ke · Shengfeng He · Xiao-Li Wang · Bo Wang · Guoqing Chao · Yuanyang Zhang · Yi Xie · HeXing Su
|
ExHall D Poster #464 | |
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training
Poster Session 6
Xuanpu Zhang · Dan Song · pengxin zhan · Tianyu Chang · Jianhao Zeng · Qing-Guo Chen · Weihua Luo · An-An Liu
|
ExHall D Poster #20 | |
DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation
Poster Session 1
Zhixuan Liang · Yao Mu · Yixiao Wang · Fei Ni · Tianxing Chen · Wenqi Shao · Wei Zhan · Masayoshi Tomizuka · Ping Luo · Mingyu Ding
|
ExHall D Poster #147 | |
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes
Poster Session 2
Chensheng Peng · Chengwei Zhang · Yixiao Wang · Chenfeng Xu · Yichen Xie · Wenzhao Zheng · Kurt Keutzer · Masayoshi Tomizuka · Wei Zhan
|
ExHall D Poster #135 | |
CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
Poster Session 4
Chongjian GE · Chenfeng Xu · Yuanfeng Ji · Chensheng Peng · Masayoshi Tomizuka · Ping Luo · Mingyu Ding · Varun Jampani · Wei Zhan
|
ExHall D Poster #261 | |
UniScene: Unified Occupancy-centric Driving Scene Generation
Poster Session 3
Bohan Li · Jiazhe Guo · Hongsi Liu · Yingshuang Zou · Yikang Ding · Xiwu Chen · Hu ZHU · Feiyang Tan · Chi Zhang · Tiancai Wang · Shuchang Zhou · Li Zhang · Xiaojuan Qi · Hao Zhao · Mu Yang · Wenjun Zeng · Xin Jin
|
ExHall D Poster #128 | |
RDD: Robust Feature Detector and Descriptor using Deformable Transformer
Poster Session 2
Gonglin Chen · Tianwen Fu · Haiwei Chen · Wenbin Teng · Hanyuan Xiao · Yajie Zhao
|
ExHall D Poster #97 | |
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Poster Session 6
Jiayi Guo · Zhao Junhao · Chaoqun Du · Yulin Wang · Chunjiang Ge · Zanlin Ni · Shiji Song · Humphrey Shi · Gao Huang
|
ExHall D Poster #417 | |
Full-DoF Egomotion Estimation for Event Cameras Using Geometric Solvers
Ji Zhao · Banglei Guan · Zibin Liu · Laurent Kneip
|
ExHall D Poster #83 | |
CCIN: Compositional Conflict Identification and Neutralization for Composed Image Retrieval
Likai Tian · Jian Zhao · Zechao Hu · Zhengwei Yang · Hao Li · Lei Jin · Zheng Wang · Xuelong Li
|
ExHall D Poster #362 | |
V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy
Poster Session 6
Jiayin Zhao · Zhenqi Fu · Tao Yu · Hui Qiao
|
ExHall D Poster #25 | |
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
Poster Session 1
Cheng Chen · Yunpeng Zhai · Yifan Zhao · Jinyang Gao · Bolin Ding · Jia Li
|
ExHall D Poster #348 | |
OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints
Mingjie Pan · Jiyao Zhang · Tianshu Wu · Yinghao Zhao · Wenlong Gao · Hao Dong
|
ExHall D Poster #150 | |
GraphI2P: Image-to-Point Cloud Registration with Exploring Pattern of Correspondence via Graph Learning
Poster Session 5
Lin Bie · Shouan Pan · Siqi Li · Yining Zhao · Yue Gao
|
ExHall D Poster #106 | |
SEEN-DA: SEmantic ENtropy guided Domain-aware Attention for Domain Adaptive Object Detection
Poster Session 5
Haochen Li · Rui Zhang · Hantao Yao · Xin Zhang · Yifan Hao · Xinkai Song · Shaohui Peng · Yongwei Zhao · Zhao Chen · Yanjun Wu · Ling Li
|
ExHall D Poster #422 | |
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability
Poster Session 2
Yingdong Shi · Changming Li · Yifan Wang · Yongxiang Zhao · Anqi Pang · Sibei Yang · Jingyi Yu · Kan Ren
|
ExHall D Poster #269 | |
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Poster Session 5
Linke Ouyang · Yuan Qu · Hongbin Zhou · Jiawei Zhu · Rui Zhang · Qunshu Lin · Bin Wang · Zhiyuan Zhao · Man Jiang · Xiaomeng Zhao · Jin Shi · Fan Wu · Pei Chu · Minghao Liu · Zhenxiang Li · Chao Xu · Bo Zhang · Botian Shi · Zhongying Tu · Conghui He
|
ExHall D Poster #364 | |
Probability Density Geodesics in Image Diffusion Latent Space
Poster Session 6
Qingtao Yu · Jaskirat Singh · Zhaoyuan Yang · Peter Henry Tu · Jing Zhang · Richard Hartley · Hongdong Li · Dylan Campbell
|
ExHall D Poster #173 | |
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models
Poster Session 2
Zhejun Zhang · Peter Karkus · Maximilian Igl · Wenhao Ding · Yuxiao Chen · Boris Ivanovic · Marco Pavone
|
ExHall D Poster #334 | |
All-Optical Nonlinear Diffractive Deep Network for Ultrafast Image Denoising
Xiaoling Zhou · Zhemg Lee · Wei Ye · Rui Xie · Wenbo Zhang · Guanju Peng · Zongze Li · Shikun Zhang
|
ExHall D Poster #196 | |
Learning Affine Correspondences by Integrating Geometric Constraints
Poster Session 6
Pengju Sun · Banglei Guan · Zhenbao Yu · Yang Shang · Qifeng Yu · Daniel Barath
|
ExHall D Poster #85 | |
LaTexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending
Jian Jin · Zhenbo Yu · Yang Shen · Zhenyong Fu · Jian Yang
|
ExHall D Poster #242 | |
STDD: Spatio-Temporal Dual Diffusion for Video Generation
Poster Session 3
Shuaizhen Yao · Xiaoya Zhang · Xin Liu · Mengyi Liu · Zhen Cui
|
ExHall D Poster #183 | |
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Poster Session 4
Fuyun Wang · Tong Zhang · Yuanzhi Wang · Yide Qiu · Xin Liu · Xu Guo · Zhen Cui
|
ExHall D Poster #439 | |
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation
Poster Session 5
Dingcheng Zhen · Shunshun Yin · Shiyang Qin · Hou Yi · Ziwei Zhang · Siyuan Liu · Gan Qi · Ming Tao
|
ExHall D Poster #3 | |
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
Shijia Zhao · Qiming Xia · Xusheng Guo · Pufan Zou · Maoji Zheng · Hai Wu · Chenglu Wen · Cheng Wang
|
ExHall D Poster #308 | |
FedCALM: Conflict-aware Layer-wise Mitigation for Selective Aggregation in Deeper Personalized Federated Learning
Poster Session 3
Hao Zheng · Zhigang Hu · Boyu Wang · Liu Yang · Meiguang Zheng · Aikun Xu
|
ExHall D Poster #459 | |
Towards Precise Scaling Laws for Video Diffusion Transformers
Poster Session 4
Yuanyang Yin · Yaqi Zhao · Mingwu Zheng · Ke Lin · Jiarong Ou · Rui Chen · Victor Shea-Jay Huang · Jiahao Wang · Xin Tao · Pengfei Wan · Di ZHANG · Baoqun Yin · Wentao Zhang · Kun Gai
|
ExHall D Poster #224 | |
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
Poster Session 2
Qiuheng Wang · Yukai Shi · Jiarong Ou · Rui Chen · Ke Lin · Jiahao Wang · Boyuan Jiang · Haotian Yang · Mingwu Zheng · Xin Tao · Fei Yang · Pengfei Wan · Di ZHANG
|
ExHall D Poster #292 | |
Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning
Poster Session 5
Na Zheng · Xuemeng Song · Xue Dong · Aashish Nikhil Ghosh · Liqiang Nie · Roger Zimmermann
|
ExHall D Poster #447 | |
Neuro-3D: Towards 3D Visual Decoding from EEG Signals
Poster Session 5
Zhanqiang Guo · Jiamin Wu · Yonghao Song · Jiahui Bu · Weijian Mai · Qihao Zheng · Wanli Ouyang · Chunfeng Song
|
ExHall D Poster #273 | |
Boosting Adversarial Transferability through Augmentation in Hypothesis Space
Poster Session 4
Yu Guo · Weiquan Liu · Qingshan Xu · Shijun Zheng · Shujun Huang · Yu Zang · Siqi Shen · Chenglu Wen · Cheng Wang
|
ExHall D Poster #322 | |
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI
Poster Session 3
Siyuan Cheng · Lingjuan Lyu · Zhenting Wang · Xiangyu Zhang · Vikash Sehwag
|
ExHall D Poster #268 | |
CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving
Poster Session 4
Dongkun Zhang · Jiaming Liang · Ke Guo · Sha Lu · Qi Wang · Rong Xiong · Zhenwei Miao · Yue Wang
|
ExHall D Poster #136 | |
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Poster Session 1
Yunzhi Yan · Zhen Xu · Haotong Lin · Haian Jin · Haoyu Guo · Yida Wang · Kun Zhan · XianPeng Lang · Hujun Bao · Xiaowei Zhou · Sida Peng
|
ExHall D Poster #61 | |
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
Poster Session 2
Tao Xie · Xi Chen · Zhen Xu · Yiman Xie · Yudong Jin · Yujun Shen · Sida Peng · Hujun Bao · Xiaowei Zhou
|
ExHall D Poster #28 | |
FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction
Poster Session 5
Yifan Wang · Peishan Yang · Zhen Xu · Jiaming Sun · Zhanhua Zhang · chen yong · Hujun Bao · Sida Peng · Xiaowei Zhou
|
ExHall D Poster #66 | |
Towards Explainable and Unprecedented Accuracy in Matching Challenging Finger Crease Patterns
Zhenyu Zhou · Chengdong Dong · Ajay Kumar
|
ExHall D Poster #74 | |
All-Day Multi-Camera Multi-Target Tracking
Poster Session 4
Huijie Fan · Yu Qiao · Yihao Zhen · Tinghui Zhao · Baojie Fan · Qiang Wang
|
ExHall D Poster #103 | |
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
Poster Session 1
Xuanyu Zhang · Zecheng Tang · Zhipei Xu · Runyi Li · Youmin Xu · Bin Chen · Feng Gao · Jian Zhang
|
ExHall D Poster #272 | |
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
Poster Session 1
Yangyu Huang · Tianyi Gao · Haoran Xu · Qihao Zhao · Yang Song · Zhipeng Gui · Tengchao Lv · Hao Chen · Lei Cui · Scarlett Li · Furu Wei
|
ExHall D Poster #355 | |
DreamTrack: Dreaming the Future for Multimodal Visual Object Tracking
Poster Session 2
Mingzhe Guo · Weiping Tan · Wenyu Ran · Liping Jing · Zhipeng Zhang
|
ExHall D Poster #176 | |
CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes
Poster Session 6
ziteng xue · Mingzhe Guo · Heng Fan · Shihui Zhang · Zhipeng Zhang
|
ExHall D Poster #120 | |
Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy
Zesen Cheng · Hang Zhang · Kehan Li · Sicong Leng · Zhiqiang Hu · Fei Wu · Deli Zhao · Xin Li · Lidong Bing
|
ExHall D Poster #444 | |
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation
Poster Session 1
Zhiqiang Shen · Ammar Sherif · Zeyuan Yin · Shitong Shao
|
ExHall D Poster #443 | |
Prior-free 3D Object Tracking
Xiuqiang Song · Li Jin · Zhengxian Zhang · Jiachen Li · Fan Zhong · Guofeng Zhang · Xueying Qin
|
ExHall D Poster #96 | |
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Poster Session 6
Chuhao Chen · Zhiyang Dou · Chen Wang · Yiming Huang · Anjun Chen · Qiao Feng · Jiatao Gu · Lingjie Liu
|
ExHall D Poster #37 | |
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Edward LOO · Tianyu HUANG · Peng Li · Zhiyang Dou · Cheng Lin · Zhiming Cui · Zhen Dong · Sai-Kit Yeung · Wenping Wang · Yuan Liu
|
ExHall D Poster #168 | |
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
Poster Session 6
Shunlin Lu · Jingbo Wang · Zeyu Lu · Ling-Hao Chen · Wenxun Dai · Junting Dong · Zhiyang Dou · Bo Dai · Ruimao Zhang
|
ExHall D Poster #162 | |
Interleaved-Modal Chain-of-Thought
Poster Session 4
Jun Gao · Yongqi Li · Ziqiang Cao · Wenjie Li
|
ExHall D Poster #354 | |
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning
Poster Session 5
Gaojian Wang · Feng Lin · Tong Wu · Zhenguang Liu · Zhongjie Ba · Kui Ren
|
ExHall D Poster #319 | |
Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models
Poster Session 4
Zhenguang Liu · Chao Shuai · Shaojing Fan · Ziping Dong · Jinwu Hu · Zhongjie Ba · Kui Ren
|
ExHall D Poster #274 | |
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
Poster Session 5
longbin ji · Lei Zhong · Pengfei Wei · Changjian Li
|
ExHall D Poster #163 | |
Mosaic of Modalities: A Comprehensive Benchmark for Multimodal Graph Learning
Poster Session 3
Jing Zhu · Yuhang Zhou · Shengyi Qian · Zhongmou He · Tong Zhao · Neil Shah · Danai Koutra
|
ExHall D Poster #341 | |
UCM-VeID V2: A Richer Dataset and A Pre-training Method for UAV Cross-Modality Vehicle Re-Identification
Poster Session 5
Xingyue Liu · Jiahao Qi · Chen Chen · Kangcheng Bin · Ping Zhong
|
ExHall D Poster #118 | |
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Poster Session 6
Qifan Yu · Wei Chow · Zhongqi Yue · Kaihang Pan · Yang Wu · Xiaoyang Wan · Juncheng Li · Siliang Tang · Hanwang Zhang · Yueting Zhuang
|
ExHall D Poster #214 | |
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
Poster Session 6
Kaihang Pan · w l · Zhongqi Yue · Tenglong Ao · Liyu Jia · Wei Zhao · Juncheng Li · Siliang Tang · Hanwang Zhang
|
ExHall D Poster #334 | |
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Poster Session 4
Yingying Fan · Quanwei Yang · Kaisiyuan Wang · Hang Zhou · Yingying Li · Haocheng Feng · Errui Ding · Yu Wu · Jingdong Wang
|
ExHall D Poster #167 | |
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Poster Session 5
Jiahao Cui · Hui Li · Qingkun Su · Hanlin Shang · Kaihui Cheng · Yuqi Ma · Shan Mu · Hang Zhou · Jingdong Wang · Siyu Zhu
|
ExHall D Poster #4 | |
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita · Stanislav Frolov · Brian Bernhard Moser · Takahiro Shirakawa · Ko Watanabe · Andreas Dengel · Jinjia Zhou
|
ExHall D Poster #227 | |
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding
Tianyu Chen · Xingcheng Fu · Yisen Gao · Haodong Qian · Yuecen Wei · Kun Yan · Haoyi Zhou · Jianxin Li
|
ExHall D Poster #376 | |
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Poster Session 3
Jun Zhou · Jiahao Li · Zunnan Xu · Hanhui Li · Yiji Cheng · Fa-Ting Hong · Qin Lin · qinglin lu · Xiaodan Liang
|
ExHall D Poster #233 | |
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Poster Session 4
Boran Wen · Dingbang Huang · Zichen Zhang · Jiahong Zhou · Jianbin Deng · Jingyu Gong · Yulong Chen · Lizhuang Ma · Yonglu Li
|
ExHall D Poster #156 | |
SVFR: A Unified Framework for Generalized Video Face Restoration
Poster Session 2
Zhiyao Wang · Xu Chen · Chengming Xu · Junwei Zhu · Xiaobin Hu · Jiangning Zhang · Chengjie Wang · Yuqi Liu · Yiyi Zhou · Rongrong Ji
|
ExHall D Poster #195 | |
Mamba-Reg: Vision Mamba Also Needs Registers
Poster Session 3
Feng Wang · Jiahao Wang · Sucheng Ren · Guoyizhe Wei · Jieru Mei · Wei Shao · Yuyin Zhou · Alan L. Yuille · Cihang Xie
|
ExHall D Poster #411 | |
Adventurer: Optimizing Vision Mamba Architecture Designs for Efficiency
Poster Session 6
Feng Wang · Timing Yang · Yaodong Yu · Sucheng Ren · Guoyizhe Wei · Angtian Wang · Wei Shao · Yuyin Zhou · Alan L. Yuille · Cihang Xie
|
ExHall D Poster #384 | |
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
Poster Session 4
Zhenglin Zhou · Fan Ma · Hehe Fan · Tat-seng Chua
|
ExHall D Poster #8 | |
Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration
Poster Session 5
Junyuan Deng · Xinyi Wu · Yongxing Yang · Congchao Zhu · Song Wang · Zhenyao Wu
|
ExHall D Poster #204 | |
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts
Jiansheng Li · Xingxuan Zhang · Hao Zou · Yige Guo · Renzhe Xu · Yilong Liu · Chuzhao Zhu · Yue He · Peng Cui
|
ExHall D Poster #364 | |
Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective
Poster Session 5
Duowang Zhu · Xiaohu Huang · Haiyan Huang · Hao Zhou · Zhenfeng Shao
|
ExHall D Poster #286 | |
beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation
Poster Session 6
Ming Hu · Jianfu Yin · Zhuangzhuang Ma · Jianheng Ma · Feiyu Zhu · Bingbing Wu · Ya Wen · Meng Wu · C Hu · Bingliang Hu · Quan Wang
|
ExHall D Poster #449 | |
ReDiffDet: Rotation-equivariant Diffusion Model for Oriented Object Detection
Poster Session 5
Jiaqi Zhao · Zeyu Ding · Yong Zhou · Hancheng Zhu · Wen-Liang Du · Rui Yao
|
ExHall D Poster #325 | |
Mitigating Ambiguities in 3D Classification with Gaussian Splatting
Poster Session 6
Ruiqi Zhang · Hao Zhu · Jingyi Zhao · Qi Zhang · Xun Cao · Zhan Ma
|
ExHall D Poster #107 | |
Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction
Poster Session 3
Li Fang · Hao Zhu · Longlong Chen · Fei Hu · Long Ye · Zhan Ma
|
ExHall D Poster #54 | |
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis
Poster Session 5
You Wang · Li Fang · Hao Zhu · Fei Hu · Long Ye · Zhan Ma
|
ExHall D Poster #29 | |
GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting
Poster Session 4
Zixuan Chen · Guangcong Wang · Jiahao Zhu · Jianhuang Lai · Xiaohua Xie
|
ExHall D Poster #45 | |
INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations
Poster Session 3
Yongming Zhu · Longhao Zhang · Zhengkun Rong · Tianshu Hu · Shuang Liang · Zhipengge
|
ExHall D Poster #2 | |
AniMer: Animal Pose and Shape Estimation Using Family Aware Transformer
Poster Session 4
Jin Lyu · Tianyi Zhu · Yi Gu · Li Lin · Pujin Cheng · Yebin Liu · Xiaoying Tang · Liang An
|
ExHall D Poster #161 | |
S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors
Poster Session 4
Xingyu Ren · Jiankang Deng · Yuhao Cheng · Wenhan Zhu · Yichao Yan · Xiaokang Yang · Stefanos Zafeiriou · Chao Ma
|
ExHall D Poster #18 | |
Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture
Poster Session 1
Xuanchen Li · Jianyu Wang · Yuhao Cheng · Yikun Zeng · Xingyu Ren · Wenhan Zhu · Weiming Zhao · Yichao Yan
|
ExHall D Poster #4 | |
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Poster Session 4
Shehan Munasinghe · Hanan Gani · Wenqi Zhu · Jiale Cao · Eric P. Xing · Fahad Shahbaz Khan · Salman Khan
|
ExHall D Poster #309 | |
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Poster Session 4
Henrique Morimitsu · Xiaobin Zhu · Roberto M. Cesar Jr · Xiangyang Ji · Xu-Cheng Yin
|
ExHall D Poster #191 | |
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation
Hao Zhu · Yan Zhu · Jiayu Xiao · Tianxiang Xiao · Yike Ma · Yucheng Zhang · Feng Dai
|
ExHall D Poster #324 | |
Rethinking Query-based Transformer for Continual Image Segmentation
Poster Session 1
Yuchen Zhu · Cheng Shi · Dingyou Wang · Jiajin Tang · Zhengxuan Wei · Yu Wu · Guanbin Li · Sibei Yang
|
ExHall D Poster #424 | |
CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image
Jingshun Huang · Haitao Lin · Tianyu Wang · Yanwei Fu · Xiangyang Xue · Yi Zhu
|
ExHall D Poster #97 | |
EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights
Poster Session 4
Zhenghao Xing · Hao Chen · Binzhu Xie · Jiaqi Xu · Ziyu Guo · Xuemiao Xu · Jianye Hao · Chi-Wing Fu · Xiaowei Hu · Pheng-Ann Heng
|
ExHall D Poster #315 | |
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
Poster Session 3
Zhihe Yang · Xufang Luo · Dongqi Han · Yunjian Xu · Dongsheng Li
|
ExHall D Poster #373 | |
HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
Poster Session 5
Kun Liu · Qi Liu · Xinchen Liu · Jie Li · Yongdong Zhang · Jiebo Luo · Xiaodong He · Wu Liu
|
ExHall D Poster #285 | |
Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
Poster Session 4
Tianhao Qi · Jianlong Yuan · Wanquan Feng · Shancheng Fang · Jiawei Liu · SiYu Zhou · Qian HE · Hongtao Xie · Yongdong Zhang
|
ExHall D Poster #291 | |
Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model
Poster Session 5
Zheyu Zhang · Yayuan Lu · Feipeng Ma · Yueyi Zhang · Huanjing Yue · Xiaoyan Sun
|
ExHall D Poster #475 | |
VISTREAM: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network
Poster Session 2
Kang You · Ziling Wei · Jing Yan · Boning Zhang · Qinghai Guo · Yaoyu Zhang · Zhezhi He
|
ExHall D Poster #327 | |
SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization
Poster Session 4
Yi Du · Zhipeng Zhao · Shaoshu Su · Sharath Golluri · Haoze Zheng · Runmao Yao · Chen Wang
|
ExHall D Poster #109 | |
Bridging Modalities: Improving Universal Multimodal Retrieval by Multimodal Large Language Models
Poster Session 2
Xin Zhang · Yanzhao Zhang · Wen Xie · Mingxin Li · Ziqi Dai · Dingkun Long · Pengjun Xie · Meishan Zhang · Wenjie Li · Min Zhang
|
ExHall D Poster #372 | |
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Poster Session 1
Ziqi Pang · Tianyuan Zhang · Fujun Luan · Yunze Man · Hao Tan · Kai Zhang · William Freeman · Yu-Xiong Wang
|
ExHall D Poster #222 | |
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation
Poster Session 2
Lang Lin · Xueyang Yu · Ziqi Pang · Yu-Xiong Wang
|
ExHall D Poster #314 | |
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Poster Session 6
Zengqun Zhao · Ziquan Liu · Yu Cao · Shaogang Gong · Ioannis Patras
|
ExHall D Poster #246 | |
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding
Poster Session 3
Rong Gao · Xin Liu · Zhuozhao Hu · Bohao Xing · Baiqiang XIA · Zitong YU · Heikki Kälviäinen
|
ExHall D Poster #281 | |
MoEdit: On Learning Quantity Perception for Multi-object Image Editing
Poster Session 1
Yanfeng Li · Ka-Hou Chan · Yue Sun · Chan-Tong Lam · Tong Tong · Zitong YU · Keren Fu · Xiaohong Liu · Tao Tan
|
ExHall D Poster #241 | |
EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models
Poster Session 2
Yinan Liang · Ziwei Wang · Xiuwei Xu · Jie Zhou · Jiwen Lu
|
ExHall D Poster #388 | |
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
Wenxuan Guo · Xiuwei Xu · Ziwei Wang · Jianjiang Feng · Jie Zhou · Jiwen Lu
|
ExHall D Poster #333 | |
MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing
Poster Session 6
Cong Wang · Di Kang · Heyi Sun · SHENHAN QIAN · Zixuan Wang · Linchao Bao · Song-Hai Zhang
|
ExHall D Poster #7 | |
PrEditor3D: Fast and Precise 3D Shape Editing
Poster Session 1
Ziya Erkoc · Can Gümeli · Chaoyang Wang · Matthias Nießner · Angela Dai · Peter Wonka · Hsin-Ying Lee · Peiye Zhuang
|
ExHall D Poster #44 | |
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Poster Session 1
Ziyang Wang · Shoubin Yu · Elias Stengel-Eskin · Jaehong Yoon · Feng Cheng · Gedas Bertasius · Mohit Bansal
|
ExHall D Poster #297 | |
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Poster Session 1
Ziyang Xie · Zhizheng Liu · Zhenghao Peng · Wayne Wu · Bolei Zhou
|
ExHall D Poster #132 | |
ProtoDepth: Unsupervised Continual Depth Completion with Prototypes
Poster Session 2
Patrick Rim · Hyoungseob Park · Suchisrit Gangopadhyay · Ziyao Zeng · Younjoon Chung · Alex Wong
|
ExHall D Poster #85 | |
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification
Poster Session 5
Yanghao Wang · Long Chen
|
ExHall D Poster #432 | |
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Poster Session 5
Zijing Hu · Fengda Zhang · Long Chen · Kun Kuang · Jiahui Li · Kaifeng Gao · Jun Xiao · Xin Wang · Wenwu Zhu
|
ExHall D Poster #245 | |
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Wei Chen · Lin Li · Yongqi Yang · Bin Wen · Fan Yang · Tingting Gao · Yu Wu · Long Chen
|
ExHall D Poster #258 | |
Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
Poster Session 4
Kenkun Liu · Yurong Fu · Weihao Yuan · Jing Lin · Peihao Li · Xiaodong Gu · Lingteng Qiu · Haoqian Wang · Zilong Dong · Xiaoguang Han
|
ExHall D Poster #165 | |
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
Poster Session 5
Lingteng Qiu · Shenhao Zhu · Qi Zuo · Xiaodong Gu · Yuan Dong · Junfei Zhang · Chao Xu · Zhe Li · Weihao Yuan · Liefeng Bo · Guanying Chen · Zilong Dong
|
ExHall D Poster #10 | |
PAVE: Patching and Adapting Video Large Language Models
Poster Session 1
Zhuoming Liu · Yiquan Li · Khoi D Nguyen · Yiwu Zhong · Yin Li
|
ExHall D Poster #300 | |
IterIS: Iterative Inference-Solving Alignment for LoRA Merging
Poster Session 1
Hongxu chen · Zhen Wang · Runshi Li · Bowei Zhu · Long Chen
|
ExHall D Poster #446 | |
LLM-driven Multimodal and Multi-Identity Listening Head Generation
Poster Session 3
Peiwen Lai · Weizhi Zhong · Yipeng Qin · Xiaohang Ren · Baoyuan Wang · Guanbin Li
|
ExHall D Poster #1 | |
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Poster Session 1
Zhimin Liao · Ping Wei · Shuaijia Chen · Haoxuan Wang · Ziyang Ren
|
ExHall D Poster #126 | |
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
Poster Session 4
Yunhe Gao · Di Liu · Zhuowei Li · Yunsheng Li · Dongdong Chen · Mu Zhou · Dimitris N. Metaxas
|
ExHall D Poster #478 | |
MLLM-as-a-Judge for Image Safety without Human Labeling
Zhenting Wang · Shuming Hu · Shiyu Zhao · Xiaowen Lin · Felix Juefei-Xu · Zhuowei Li · Ligong Han · Harihar Subramanyam · Li Chen · Jianfa Chen · nan jiang · Lingjuan Lyu · Shiqing Ma · Dimitris N. Metaxas · Ankit Jain
|
ExHall D Poster #384 | |
MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos
Poster Session 3
Zhengqi Li · Richard Tucker · Forrester Cole · Qianqian Wang · Linyi Jin · Vickie Ye · Angjoo Kanazawa · Aleksander Holynski · Noah Snavely
|
ExHall D Poster #78 | |
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos
Poster Session 3
Linyi Jin · Richard Tucker · Zhengqi Li · David Fouhey · Noah Snavely · Aleksander Holynski
|
ExHall D Poster #88 | |
Can Generative Video Models Help Pose Estimation?
Ruojin Cai · Jason Y. Zhang · Philipp Henzler · Zhengqi Li · Noah Snavely · Ricardo Martin
|
ExHall D Poster #90 | |
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation
Poster Session 2
Yiyang Ma · Xingchao Liu · Xiaokang Chen · Wen Liu · Chengyue Wu · Zhiyu Wu · Zizheng Pan · Zhenda Xie · Haowei Zhang · Xingkai Yu · Liang Zhao · Yisong Wang · Jiaying Liu · Chong Ruan
|
ExHall D Poster #227 | |
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
Poster Session 3
Zhiyuan Ma · Xinyue Liang · Rongyuan Wu · Xiangyu Zhu · Zhen Lei · Lei Zhang
|
ExHall D Poster #37 | |
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach
Poster Session 1
Lingchen Sun · Rongyuan Wu · Zhiyuan Ma · Shuaizheng Liu · Qiaosi Yi · Lei Zhang
|
ExHall D Poster #204 | |
MVBoost: Boost 3D Reconstruction with Multi-View Refinement
Poster Session 5
Xiangyu Liu · Xiaomei Zhang · Zhiyuan Ma · Xiangyu Zhu · Zhen Lei
|
ExHall D Poster #58 | |
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Poster Session 1
Ziming Huang · Xurui Li · Haotian Liu · Feng Xue · Yuzhe Wang · Yu Zhou
|
ExHall D Poster #439 | |
SpiritSight Agent: Advanced GUI Agent with One Look
Poster Session 6
Zhiyuan Huang · Ziming Cheng · Junting Pan · Zhaohui Hou · Mingjie Zhan
|
ExHall D Poster #319 | |
Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition
Hongda Liu · Yunfan Liu · Min Ren · Hao Wang · Yunlong Wang · Zhenan Sun
|
ExHall D Poster #296 | |
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction
Poster Session 6
Shiyu Zhao · Zhenting Wang · Felix Juefei-Xu · Xide Xia · Miao Liu · Xiaofang Wang · Mingfu Liang · Ning Zhang · Dimitris N. Metaxas · Licheng Yu
|
ExHall D Poster #356 | |
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Poster Session 4
Bolin Lai · Felix Juefei-Xu · Miao Liu · Xiaoliang Dai · Nikhil Mehta · Chenguang Zhu · Zeyi Huang · James Rehg · Sangmin Lee · Ning Zhang · Tong Xiao
|
ExHall D Poster #243 | |
Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation
Poster Session 6
Qiang Zhang · Mengsheng Zhao · Jiawei Liu · Fanrui Zhang · Yongchao Xu · Zheng-Jun Zha
|
ExHall D Poster #419 | |
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosC
Poster Session 5
Yuan Li · Ziqian Bai · Feitong Tan · Zhaopeng Cui · Sean Fanello · Yinda Zhang
|
ExHall D Poster #6 | |
AirRoom: Objects Matter in Room Reidentification
Poster Session 1
Runmao Yao · Yi Du · Zhuoqun Chen · Haoze Zheng · Chen Wang
|
ExHall D Poster #113 | |
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
Poster Session 3
Qi Zhu · Jiangwei Lao · Deyi Ji · Junwei Luo · Kang Wu · Yingying Zhang · Lixiang Ru · Jian Wang · Jingdong Chen · Ming Yang · Dong Liu · Feng Zhao
|
ExHall D Poster #391 | |
World-consistent Video Diffusion with Explicit 3D Modeling
Qihang Zhang · Shuangfei Zhai · Miguel Ángel Bautista · Kevin Miao · Alexander Toshev · Joshua Susskind · Jiatao Gu
|
ExHall D Poster #60 | |
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining
Poster Session 1
Shanglin Liu · Jianming Lv · Jingdan Kang · Huaidong Zhang · Zequan Liang · Shengfeng He
|
ExHall D Poster #471 | |
PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description
Poster Session 2
Ziqi Cai · Shuchen Weng · Yifei Xia · Boxin Shi
|
ExHall D Poster #239 | |
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
Poster Session 3
Zining Wang · Tongkun Guan · Pei Fu · Chen Duan · Qianyi Jiang · Zhentao Guo · Shan Guo · Junfeng Luo · Wei Shen · Xiaokang Yang
|
ExHall D Poster #364 | |
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
Poster Session 6
Tianyi Zhu · Dongwei Ren · Qilong Wang · Xiaohe Wu · Wangmeng Zuo
|
ExHall D Poster #171 | |
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels
Poster Session 5
Yongshuo Zong · Qin ZHANG · DONGSHENG An · Zhihua Li · Xiang Xu · Linghan Xu · Zhuowen Tu · Yifan Xing · Onkar Dabeer
|
ExHall D Poster #345 | |
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
Poster Session 3
Zhantao Yang · Ruili Feng · Keyu Yan · Huangji Wang · Zhicai Wang · Shangwen Zhu · Han Zhang · Jie Xiao · Pingyu Wu · Kai Zhu · Jixuan Chen · Chen-Wei Xie · Yue Yang · Hongyang Zhang · Yu Liu · Fan Cheng
|
ExHall D Poster #356 | |
SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis
Poster Session 5
Bangbang Zhou · Zuan Gao · Zixiao Wang · Boqiang Zhang · Yuxin Wang · Zhineng Chen · Hongtao Xie
|
ExHall D Poster #360 | |
BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction
Yuguang Li · Ivaylo Boyadzhiev · Zixuan Liu · Linda Shapiro · Alex Colburn
|
ExHall D Poster #92 | |
Collaborative Tree Search for Enhancing Embodied Multi-Agent Collaboration
Poster Session 6
Lizheng Zu · Lin Lin · Song Fu · Na Zhao · Pan Zhou
|
ExHall D Poster #321 | |
Federated Learning with Domain Shift Eraser
Poster Session 1
Zheng Wang · Zihui Wang · Zheng Wang · Xiaoliang Fan · Cheng Wang
|
ExHall D Poster #460 | |
Homogeneous Dynamics Space for Heterogeneous Humans
Poster Session 6
Xinpeng Liu · Junxuan Liang · Chenshuo Zhang · Zixuan Cai · Cewu Lu · Yonglu Li
|
ExHall D Poster #154 | |
Linear Attention Modeling for Learned Image Compression
Poster Session 2
Donghui Feng · Zhengxue Cheng · Shen Wang · Ronghua Wu · Hongwei Hu · Guo Lu · Li Song
|
ExHall D Poster #215 | |
Decoupled Motion Expression Video Segmentation
Poster Session 3
Hao Fang · Runmin Cong · Xiankai Lu · Xiaofei Zhou · Sam Kwong · Wei Zhang
|
ExHall D Poster #303 | |
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion
Poster Session 6
Zexin He · Tengfei Wang · Xin Huang · Xingang Pan · Ziwei Liu
|
ExHall D Poster #34 | |
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
Poster Session 4
Mark Boss · Zixuan Huang · Aaryaman Vasishta · Varun Jampani
|
ExHall D Poster #37 | |
Symmetry Strikes Back: From Single-Image Symmetry Detection to 3D Generation
Poster Session 1
Xiang Li · Zixuan Huang · Anh Thai · James Rehg
|
ExHall D Poster #54 | |
Detect Any Mirrors: Boosting Learning Reliability on Large-Scale Unlabeled Data with an Iterative Data Engine
Poster Session 5
Zhaohu Xing · Lihao Liu · Yijun Yang · Hongqiu Wang · Tian Ye · Sixiang Chen · Wenxue Li · Guang Liu · Lei Zhu
|
ExHall D Poster #423 | |
SnowMaster: Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization
Poster Session 1
Jianyu LAI · Sixiang Chen · yunlong lin · Tian Ye · Yun Liu · Song Fei · Zhaohu Xing · Hongtao Wu · Weiming Wang · Lei Zhu
|
ExHall D Poster #394 | |
Tightening Robustness Verification of MaxPool-based Neural Networks via Minimizing the Over-Approximation Zone
Poster Session 4
Yuan Xiao · Yuchen Chen · Shiqing Ma · Chunrong Fang · Tongtong Bai · Mingzheng Gu · Yuxin Cheng · Yanwei Chen · Zhenyu Chen
|
ExHall D Poster #465 | |
JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems
Poster Session 1
Yifan Wang · Jian Zhao · Zhaoxin Fan · Xin Zhang · Xuecheng Wu · Yudian Zhang · Lei Jin · Xinyue Li · Gang Wang · Mengxi Jia · Ping Hu · Zheng Zhu · Xuelong Li
|
ExHall D Poster #137 | |
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
Poster Session 4
Yaohua Zha · Yanzi Wang · Hang Guo · Jinpeng Wang · Tao Dai · Bin Chen · Zhihao Ouyang · Xue Yuerong · Ke Chen · Shu-Tao Xia
|
ExHall D Poster #111 | |
Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception
Poster Session 3
Baixuan Lv · Yaohua Zha · Tao Dai · Xue Yuerong · Ke Chen · Shu-Tao Xia
|
ExHall D Poster #168 | |
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
Poster Session 5
Jinpeng Wang · Tianci Luo · Yaohua Zha · Yan Feng · Ruisheng Luo · Bin Chen · Tao Dai · Long Chen · Yaowei Wang · Shu-Tao Xia
|
ExHall D Poster #393 | |
MambaIRv2: Attentive State Space Restoration
Poster Session 6
Hang Guo · Yong Guo · Yaohua Zha · Yulun Zhang · Wenbo Li · Tao Dai · Shu-Tao Xia · Yawei Li
|
ExHall D Poster #186 | |
SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation
Poster Session 2
Hritam Basak · Zhaozheng Yin
|
ExHall D Poster #423 | |
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering
Poster Session 6
Yuanhao Zou · Zhaozheng Yin
|
ExHall D Poster #332 | |
Number it: Temporal Grounding Videos like Flipping Manga
Poster Session 3
Yongliang Wu · Xinting Hu · Yuyang Sun · Yizhou Zhou · Wenbo Zhu · Fengyun Rao · Bernt Schiele · Xu Yang
|
ExHall D Poster #297 | |
NVILA: Efficient Frontier Visual Language Models
Poster Session 1
Zhijian Liu · Ligeng Zhu · Baifeng Shi · Zhuoyang Zhang · Yuming Lou · Shang Yang · Haocheng Xi · Shiyi Cao · Yuxian Gu · Dacheng Li · Xiuyu Li · Haotian Tang · Yunhao Fang · Yukang Chen · Cheng-Yu Hsieh · De-An Huang · An-Chieh Cheng · Jinyi Hu · Sifei Liu · Ranjay Krishna · Pavlo Molchanov · Jan Kautz · Danny Yin · Song Han · Yao Lu
|
ExHall D Poster #377 | |
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Poster Session 1
Qingqing Zhao · Yao Lu · Moo Jin Kim · Zipeng Fu · Zhuoyang Zhang · Yecheng Wu · Max Li · Qianli Ma · Song Han · Chelsea Finn · Ankur Handa · Tsung-Yi Lin · Gordon Wetzstein · Ming-Yu Liu · Donglai Xiang
|
ExHall D Poster #143 | |
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
Poster Session 5
Yuchen Ren · Zhengyu Zhao · Chenhao Lin · Bo Yang · Lu Zhou · Zhe Liu · Chao Shen
|
ExHall D Poster #385 | |
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Poster Session 3
Songlong Xing · Zhengyu Zhao · Nicu Sebe
|
ExHall D Poster #433 | |
Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection
Poster Session 3
Le Yang · Ziwei Zheng · Boxu Chen · Zhengyu Zhao · Chenhao Lin · Chao Shen
|
ExHall D Poster #382 | |
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Poster Session 1
Yushu Wu · Zhixing Zhang · Yanyu Li · Yanwu Xu · Anil Kag · Yang Sui · Huseyin Coskun · Ke Ma · Aleksei Lebedev · Ju Hu · Dimitris N. Metaxas · Yanzhi Wang · Sergey Tulyakov · Jian Ren
|
ExHall D Poster #221 | |
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs
Poster Session 1
Zicheng Zhang · Ziheng Jia · Haoning Wu · Chunyi Li · Zijian Chen · Yingjie Zhou · Wei Sun · Xiaohong Liu · Xiongkuo Min · Weisi Lin · Guangtao Zhai
|
ExHall D Poster #293 | |
A Focused Human Body Model for Accurate Anthropometric Measurements Extraction
Poster Session 5
Shuhang Chen · Xianliang Huang · Zhizhou Zhong · Jihong Guan · Shuigeng Zhou
|
ExHall D Poster #152 | |
Accurate Differential Operators for Hybrid Neural Fields
Poster Session 1
Aditya Chetan · Guandao Yang · Zichen Wang · Steve Marschner · Bharath Hariharan
|
ExHall D Poster #34 | |
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
Poster Session 6
ziang yan · Zhilin Li · Yinan He · Chenting Wang · Kunchang Li · Xinhao Li · Xiangyu Zeng · Zilei Wang · Yali Wang · Yu Qiao · Limin Wang · Yi Wang
|
ExHall D Poster #357 | |
High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding
Poster Session 1
Yuanqi Li · Jingcheng Huang · Hongshen Wang · Peiyuan Lv · Yansong Liu · Jiuming Zheng · Jie Guo · Yanwen Guo
|
ExHall D Poster #104 | |
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Poster Session 6
Seokil Ham · Hee-Seon Kim · Sangmin Woo · Changick Kim
|
ExHall D Poster #378 | |
Text Augmented Correlation Transformer For Few-shot Classification & Segmentation
Poster Session 5
Srinivasa Rao Nandam · Sara Atito · Zhenhua Feng · Josef Kittler · Muhammad Awais
|
ExHall D Poster #412 | |
ESC: Erasing Space Concept for Knowledge Deletion
Tae-Young Lee · Sundong Park · Minwoo Jeon · Hyoseok Hwang · Gyeong-Moon Park
|
ExHall D Poster #463 | |
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Poster Session 4
Jinlong Li · Cristiano Saltori · Fabio Poiesi · Nicu Sebe
|
ExHall D Poster #342 | |
HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting
Poster Session 1
Jingyu Lin · Jiaqi Gu · Lubin Fan · Bojian Wu · Yujing Lou · Renjie Chen · Ligang Liu · Jieping Ye
|
ExHall D Poster #58 | |
Layered Motion Fusion: Lifting Motion Segmentation to 3D in Egocentric Videos
Poster Session 4
Vadim Tschernezki · Diane Larlus · Andrea Vedaldi · Iro Laina
|
ExHall D Poster #175 | |
Twinner: Shining Light on Digital Twins in a Few Snaps
Poster Session 2
Jesus Zarzar · Tom Monnier · Roman Shapovalov · Andrea Vedaldi · David Novotny
|
ExHall D Poster #39 | |
VGGT: Visual Geometry Grounded Transformer
Poster Session 2
Jianyuan Wang · Minghao Chen · Nikita Karaev · Andrea Vedaldi · Christian Rupprecht · David Novotny
|
ExHall D Poster #86 | |
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction
Ben Kaye · Tomas Jakab · Shangzhe Wu · Christian Rupprecht · Andrea Vedaldi
|
ExHall D Poster #100 | |
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes
Jan Held · Renaud Vandeghen · Abdullah J Hamdi · Anthony Cioppa · Adrien Deliege · Silvio Giancola · Andrea Vedaldi · Bernard Ghanem · Marc Van Droogenbroeck
|
ExHall D Poster #30 | |
PartGen: Part-level 3D Generation and Reconstruction with Multi-view Diffusion Models
Minghao Chen · Roman Shapovalov · Iro Laina · Tom Monnier · Jianyuan Wang · David Novotny · Andrea Vedaldi
|
ExHall D Poster #42 | |
A Unified Latent Schrödinger Bridge Diffusion Model for Unsupervised Anomaly Detection and Localization
Poster Session 5
Shilhora Akshay · Niveditha Lakshmi Narasimhan · Jacob George · Vineeth Balasubramanian
|
ExHall D Poster #429 | |
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
Poster Session 1
Pascal Chang · Sergio Sancho · Jingwei Tang · Markus Gross · Vinicius C. Azevedo
|
ExHall D Poster #215 | |
Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects
Poster Session 4
Amir Barda · Matheus Gadelha · Vladimir G. Kim · Noam Aigerman · Amit H. Bermano · Thibault Groueix
|
ExHall D Poster #40 | |
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery
Poster Session 6
Utkarsh Mall · Cheng Perng Phoo · Mia Chiquier · Bharath Hariharan · Kavita Bala · Carl Vondrick
|
ExHall D Poster #297 | |
SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection
Poster Session 1
Phi Vu Tran
|
ExHall D Poster #431 | |
MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures
Poster Session 3
Lucas Morin · Valery Weber · Ahmed Nassar · Gerhard Ingmar Meijer · Luc Van Gool · Yawei Li · Peter W. J. Staar
|
ExHall D Poster #368 | |
Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation
Poster Session 4
Long Tung Vuong · Hoang Phan · Vy Vo · Anh Tuan Bui · Thanh-Toan Do · Trung Le · Dinh Phung
|
ExHall D Poster #398 | |
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation
Poster Session 1
Weiming Ren · Huan Yang · Jie Min · Cong Wei · Wenhu Chen
|
ExHall D Poster #346 | |
Beyond Single-Modal Boundary: Cross-Modal Anomaly Detection through Visual Prototype and Harmonization
Poster Session 2
Kai Mao · Ping Wei · Yiyang Lian · Yangyang Wang · Nanning Zheng
|
ExHall D Poster #437 | |
Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation
Poster Session 6
Ying Jin · Jinlong Peng · Qingdong He · Teng Hu · Jiafu Wu · Hao Chen · Haoxuan Wang · wenbing zhu · Mingmin Chi · Jun Liu · Yabiao Wang
|
ExHall D Poster #409 | |
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression
Poster Session 4
Hsiang-Wei Huang · Fu-Chen Chen · Wenhao Chai · Che-Chun Su · Lu Xia · Sanghun Jung · Cheng-Yen Yang · Jenq-Neng Hwang · Min Sun · Cheng-Hao Kuo
|
ExHall D Poster #345 | |
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Poster Session 1
Rui Zhao · Weijia Mao · Mike Zheng Shou
|
ExHall D Poster #256 | |
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
Poster Session 3
Chengyou Jia · Changliang Xia · Zhuohang Dang · Weijia Wu · Hangwei Qian · Minnan Luo
|
ExHall D Poster #251 | |
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
Poster Session 2
Weijie Zhou · Manli Tao · Chaoyang Zhao · Haiyun Guo · Honghui Dong · Ming Tang · Jinqiao Wang
|
ExHall D Poster #150 | |
LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models
Poster Session 6
Fan-Yun Sun · Weiyu Liu · Siyi Gu · Dylan Lim · Goutam Bhat · Federico Tombari · Manling Li · Nick Haber · Jiajun Wu
|
ExHall D Poster #317 | |
JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration
Poster Session 5
yunlong lin · Zixu Lin · Haoyu Chen · Panwang Pan · Chenxin Li · Sixiang Chen · Kairun Wen · Yeying Jin · Wenbo Li · Xinghao Ding
|
ExHall D Poster #126 | |
EMOE: Modality-Specific Enhanced Dynamic Emotion Experts
Poster Session 3
Yiyang Fang · Wenke Huang · Guancheng Wan · Kehua Su · Mang Ye
|
ExHall D Poster #350 | |
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes
Ludwic Leonard · Nils Thuerey · rüdiger westermann
|
ExHall D Poster #30 | |
Generative Hard Example Augmentation for Semantic Point Cloud Segmentation
Poster Session 5
Qi Zhang · Jibin Peng · Zhao Huang · Wei Feng · Di Lin
|
ExHall D Poster #110 | |
Real-IAD D³: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection
Poster Session 3
wenbing zhu · Lidong Wang · Ziqing Zhou · Chengjie Wang · Yurui Pan · Ruoyi.Zhang · Zhuhao Chen · Linjie Cheng · Bin-Bin Gao · Jiangning Zhang · Zhenye Gan · Yuxie Wang · Yulong Chen · Bruce Qian · Mingmin Chi · Bo Peng · Lizhuang Ma
|
ExHall D Poster #437 | |
Generalizable Object Keypoint Localization from Generative Priors
Poster Session 4
Dongkai Wang · Jiang Duan · Liangjian Wen · Shiyu Xuan · Hao CHEN · Shiliang Zhang
|
ExHall D Poster #425 | |
Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution
Siwei Tu · Ben Fei · Weidong Yang · Fenghua Ling · Hao Chen · Zili Liu · Kun Chen · Hang Fan · Wanli Ouyang · Lei Bai
|
ExHall D Poster #181 | |
Towards Lossless Implicit Neural Representation via Bit Plane Decomposition
Poster Session 1
Woo Kyoung Han · Byeonghun Lee · Hyunmin Cho · Sunghoon Im · Kyong Hwan Jin
|
ExHall D Poster #198 | |
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation
Poster Session 4
Yiming Qin · Zhu Xu · Yang Liu
|
ExHall D Poster #262 | |
DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection
Jaewoo Song · Daemin Park · Kanghyun Baek · Sangyub Lee · Jooyoung Choi · Eunji Kim · Sungroh Yoon
|
ExHall D Poster #280 | |
Multi-Modal Aerial-Ground Cross-View Place Recognition with Neural ODEs
Poster Session 3
Sijie Wang · Rui She · Qiyu Kang · Siqi Li · Disheng Li · Tianyu Geng · Shangshu Yu · Wee Peng Tay
|
ExHall D Poster #103 | |
Three-view Focal Length Recovery From Homographies
Poster Session 3
Yaqing Ding · Viktor Kocur · Zuzana Berger Haladova · Qianliang Wu · Shen Cai · Jian Yang · Zuzana Kukelova
|
ExHall D Poster #82 | |
RobSense: A Robust Multi-modal Foundation Model for Remote Sensing with Static, Temporal, and Incomplete Data Adaptability
Poster Session 2
Minh Kha Do · Kang Han · Phu Lai · Khoa T. Phan · Wei Xiang
|
ExHall D Poster #197 | |
Generating Multimodal Driving Scenes via Next-Scene Prediction
Poster Session 2
Yanhao Wu · Haoyang Zhang · Tianwei Lin · Alan Huang · Shujie Luo · Rui Wu · Congpei Qiu · Wei Ke · Tong Zhang
|
ExHall D Poster #141 | |
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
Poster Session 3
Chenxin Tao · Shiqian Su · Xizhou Zhu · Chenyu Zhang · Zhe Chen · Jiawen Liu · Wenhai Wang · Lewei Lu · Gao Huang · Yu Qiao · Jifeng Dai
|
ExHall D Poster #374 | |
Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
Poster Session 4
Jiaming Zhang · Junhong Ye · Xingjun Ma · Yige Li · Yunfan Yang · Yunhao Chen · Jitao Sang · Dit-Yan Yeung
|
ExHall D Poster #390 | |
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Poster Session 5
Chenyu Yang · Xuan Dong · Xizhou Zhu · Weijie Su · Jiahao Wang · Hao Tian · Zhe Chen · Wenhai Wang · Lewei Lu · Jifeng Dai
|
ExHall D Poster #373 | |
SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer
Chunnan Shang · Zhizhong Wang · Hongwei Wang · Xiangming Meng
|
ExHall D Poster #229 | |
Unified Dense Prediction of Video Diffusion
Poster Session 6
Lehan Yang · Lu Qi · Xiangtai Li · Sheng Li · Varun Jampani · Ming-Hsuan Yang
|
ExHall D Poster #269 | |
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
Shengqiong Wu · Hao Fei · Jingkang Yang · Xiangtai Li · Juncheng Li · Hanwang Zhang · Tat-seng Chua
|
ExHall D Poster #335 | |
StoryGPT-V: Large Language Models as Consistent Story Visualizers
Poster Session 3
Xiaoqian Shen · Mohamed Elhoseiny
|
ExHall D Poster #250 | |
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Poster Session 6
Bingda Tang · Sayak Paul · Boyang Zheng · Saining Xie
|
ExHall D Poster #231 | |
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
Poster Session 3
Jihan Yang · Shusheng Yang · Anjali W. Gupta · Rilyn Han · Li Fei-Fei · Saining Xie
|
ExHall D Poster #287 | |
Diffusion Model is Effectively Its Own Teacher
Poster Session 3
Xinyin Ma · Runpeng Yu · Songhua Liu · Gongfan Fang · Xinchao Wang
|
ExHall D Poster #215 | |
Scaling Down Text Encoders of Text-to-Image Diffusion Models
Poster Session 4
Lifu Wang · Daqing Liu · Xinchen Liu · Xiaodong He
|
ExHall D Poster #253 | |
AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning
Poster Session 1
Kaixuan Wu · Xinde Li · Xinglin Li · Chuanfei Hu · Guoliang Wu
|
ExHall D Poster #295 | |
Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond
Poster Session 4
Guanyao Wu · Haoyu Liu · Hongming Fu · Yichuan Peng · Jinyuan Liu · Xin Fan · Risheng Liu
|
ExHall D Poster #198 | |
TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
Poster Session 4
Xin Wang · Kai Chen · Jiaming Zhang · Jingjing Chen · Xingjun Ma
|
ExHall D Poster #391 | |
Imputation-free and Alignment-free: Incomplete Multi-view Clustering Driven by Consensus Semantic Learning
Poster Session 1
yuzhuo dai · Jiaqi Jin · Zhibin Dong · Siwei Wang · Xinwang Liu · En Zhu · Xihong Yang · Xinbiao Gan · Yu Feng
|
ExHall D Poster #469 | |
EASEMVC:Efficient Dual Selection Mechanism for Deep Multi-View Clustering
Poster Session 4
Baili Xiao · Zhibin Dong · KE LIANG · Suyuan Liu · Siwei Wang · Tianrui Liu · Xingchen Hu · En Zhu · Xinwang Liu
|
ExHall D Poster #467 | |
Deformable Radial Kernel Splatting
Poster Session 5
Yihua Huang · Mingxian Lin · Yangtian Sun · Ziyi Yang · Xiaoyang Lyu · Yan-Pei Cao · Xiaojuan Qi
|
ExHall D Poster #44 | |
Parameterized Blur Kernel Prior Learning for Local Motion Deblurring
Poster Session 5
Zhenxuan Fang · Fangfang Wu · Tao Huang · Le Dong · Weisheng Dong · Xin Li · Guangming Shi
|
ExHall D Poster #185 | |
IDEA-Bench: How Far are Generative Models from Professional Designing?
Poster Session 4
Chen Liang · Lianghua Huang · Jingwu Fang · Huanzhang Dou · Wei Wang · Zhi-Fan Wu · Yupeng Shi · Junge Zhang · Xin Zhao · Yu Liu
|
ExHall D Poster #264 | |
Transformers without Normalization
Poster Session 3
Jiachen Zhu · Xinlei Chen · Kaiming He · Yann LeCun · Zhuang Liu
|
ExHall D Poster #406 | |
Enhancing Dataset Distillation via Non-Critical Region Refinement
Poster Session 2
Minh-Tuan Tran · Trung Le · Xuan-May Le · Thanh-Toan Do · Dinh Phung
|
ExHall D Poster #442 | |
Few-shot Implicit Function Generation via Equivariance
Suizhi Huang · Xingyi Yang · Hongtao Lu · Xinchao Wang
|
ExHall D Poster #39 | |
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Poster Session 6
Lanyun Zhu · Tianrun Chen · Qianxiong Xu · Xuanyi Liu · Deyi Ji · Haiyang Wu · De Soh Soh · Jun Liu
|
ExHall D Poster #391 | |
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization
Poster Session 3
Xudong Jiang · Fangjinhua Wang · Silvano Galliani · Christoph Vogel · Marc Pollefeys
|
ExHall D Poster #85 | |
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Poster Session 2
Song Wang · Xiaolu Liu · Lingdong Kong · Jianyun Xu · Chunyong Hu · Gongfan Fang · Wentong Li · Jianke Zhu · Xinchao Wang
|
ExHall D Poster #118 | |
Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution
Poster Session 2
Shijun Shi · Jing Xu · Lijing Lu · Zhihang Li · Kai Hu
|
ExHall D Poster #193 | |
Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
Poster Session 4
Ji Du · Fangwei Hao · Mingyang Yu · Desheng Kong · Jiesheng Wu · Bin Wang · Jing XU · Ping Li
|
ExHall D Poster #331 | |
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
Poster Session 6
Yukang Lin · Hokit Fung · Jianjin Xu · Zeping Ren · Adela S.M. Lau · Guosheng Yin · Xiu Li
|
ExHall D Poster #4 | |
Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models
Poster Session 1
Hao Cheng · Erjia Xiao · Jiayan Yang · Jiahang Cao · Qiang Zhang · Jize Zhang · Kaidi Xu · Jindong Gu · Renjing Xu
|
ExHall D Poster #271 | |
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
Poster Session 4
Haotong Lin · Sida Peng · Jingxiao Chen · Songyou Peng · Jiaming Sun · Minghuan Liu · Hujun Bao · Jiashi Feng · Xiaowei Zhou · Bingyi Kang
|
ExHall D Poster #120 | |
SACB-Net: Spatial-awareness Convolutions for Medical Image Registration
Poster Session 1
Xinxing Cheng · Tianyang Zhang · Wenqi Lu · Qingjie Meng · Alejandro F Frangi · Jinming Duan
|
ExHall D Poster #484 | |
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
Poster Session 1
Xiaoyan Xing · Konrad Groh · Sezer Karaoglu · Theo Gevers · Anand Bhattad
|
ExHall D Poster #26 | |
Reasoning to Attend: Try to Understand How <SEG> Token Works
Poster Session 5
Rui Qian · Xin Yin · Dejing Dou
|
ExHall D Poster #353 | |
DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters
Poster Session 5
Mingze Sun · Junting Dong · Junhao Chen · Yurun Chen · Xinyu Jiang · Shiwei Mao · Puhua Jiang · Jingbo Wang · Bo Dai · Ruqi Huang
|
ExHall D Poster #12 | |
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
Chenyangguang Zhang · Alexandros Delitzas · Fangjinhua Wang · Ruida Zhang · Xiangyang Ji · Marc Pollefeys · Francis Engelmann
|
ExHall D Poster #343 | |
UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
Poster Session 5
Xingyu Liu · Gu Wang · Ruida Zhang · Chenyangguang Zhang · Federico Tombari · Xiangyang Ji
|
ExHall D Poster #93 | |
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation
Poster Session 5
Ziqin Huang · Gu Wang · Chenyangguang Zhang · Ruida Zhang · Xiu Li · Xiangyang Ji
|
ExHall D Poster #96 | |
DocVLM: Make Your VLM an Efficient Reader
Poster Session 6
Mor Shpigel Nacson · Aviad Aberdam · Roy Ganz · Elad Ben Avraham · Alona Golts · Yair Kittenplon · Shai Mazor · Ron Litman
|
ExHall D Poster #274 | |
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Poster Session 2
Qirui Jiao · Daoyuan Chen · Yilun Huang · Bolin Ding · Yaliang Li · Ying Shen
|
ExHall D Poster #374 | |
DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework
Poster Session 1
Yalong Xu · Lin Zhao · Chen Gong · Guangyu Li · Di Wang · Nannan Wang
|
ExHall D Poster #92 | |
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Poster Session 1
Wataru Shimoda · Naoto Inoue · Daichi Haraguchi · Hayato Mitani · Seiichi Uchida · Kota Yamaguchi
|
ExHall D Poster #248 | |
Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation
Poster Session 5
Rong Qin · Xingyu Liu · Jinglei Shi · Liang Lin · Jufeng Yang
|
ExHall D Poster #473 | |
PS-Diffusion: Photorealistic Subject-Driven Image Editing with Disentangled Control and Attention
Poster Session 4
Weicheng Wang · Guoli Jia · Zhongqi Zhang · Liang Lin · Jufeng Yang
|
ExHall D Poster #239 | |
No Pains, More Gains: Recycling Sub-Salient Patches for Efficient High-Resolution Image Recognition
Rong Qin · Xin Liu · Xingyu Liu · Jiaxuan Liu · Jinglei Shi · Liang Lin · Jufeng Yang
|
ExHall D Poster #413 | |
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
Poster Session 1
Xi Wang · Hongzhen Li · Heng Fang · YICHEN PENG · Haoran Xie · Xi Yang · Chuntao Li
|
ExHall D Poster #263 | |
Multi-Label Prototype Visual Spatial Search for Weakly Supervised Semantic Segmentation
Songsong Duan · Xi Yang · Nannan Wang
|
ExHall D Poster #392 | |
Detecting Open World Objects via Partial Attribute Assignment
Poster Session 4
Muli Yang · Gabriel James Goenawan · Huaiyuan Qin · Kai Han · Xi Peng · Yanhua Yang · Hongyuan Zhu
|
ExHall D Poster #430 | |
Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery
Poster Session 6
Jiahua Rao · hanjing Lin · Leyu Chen · Jiancong Xie · Shuangjia Zheng · Yuedong Yang
|
ExHall D Poster #441 | |
Image Referenced Sketch Colorization Based on Animation Creation Workflow
Poster Session 5
Dingkun Yan · Xinrui Wang · Zhuoru Li · Suguru Saito · Yusuke Iwasawa · Yutaka Matsuo · Jiaxian Guo
|
ExHall D Poster #223 | |
The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generationf
Poster Session 1
Yanis Benidir · Nicolas Gonthier · Clement Mallet
|
ExHall D Poster #191 | |
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception
Poster Session 1
Yaniv Benny · Lior Wolf
|
ExHall D Poster #72 | |
Improving Transferable Targeted Attacks with Feature Tuning Mixup
Poster Session 5
Kaisheng Liang · Xuelong Dai · Yanjie Li · Dong Wang · Bin Xiao
|
ExHall D Poster #457 | |
SET: Spectral Enhancement for Tiny Object Detection
Poster Session 1
Huixin Sun · Runqi Wang · Yanjing Li · Linlin Yang · Shaohui Lin · Xianbin Cao · Baochang Zhang
|
ExHall D Poster #435 | |
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
Shenghao Fu · Qize Yang · Qijie Mo · Junkai Yan · Xihan Wei · Jingke Meng · Xiaohua Xie · Wei-Shi Zheng
|
ExHall D Poster #415 | |
Towards Consistent Multi-Task Learning: Unlocking the Potential of Task-Specific Parameters
Poster Session 2
Xiaohan Qin · Xiaoxing Wang · Junchi Yan
|
ExHall D Poster #447 | |
Revisiting Fairness in Multitask Learning: A Performance-Driven Approach for Variance Reduction
Poster Session 4
Xiaohan Qin · Xiaoxing Wang · Junchi Yan
|
ExHall D Poster #446 | |
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Poster Session 4
Xiaomeng Chu · Jiajun Deng · Guoliang You · Yifan Duan · Houqiang Li · Yanyong Zhang
|
ExHall D Poster #121 | |
OccMamba: Semantic Occupancy Prediction with State Space Models
Poster Session 3
Heng Li · Yuenan Hou · Xiaohan Xing · Yuexin Ma · Xiao Sun · Yanyong Zhang
|
ExHall D Poster #126 | |
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Poster Session 5
Yifang Men · Yuan Yao · Miaomiao Cui · Liefeng Bo
|
ExHall D Poster #13 | |
Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
Poster Session 2
chaocan xue · Bineng Zhong · Qihua Liang · Yaozong Zheng · Ning Li · Yuanliang Xue · Shuxiang Song
|
ExHall D Poster #130 | |
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Poster Session 2
Andong Deng · Tongjia Chen · Shoubin Yu · Taojiannan Yang · Lincoln Spencer · Yapeng Tian · Ajmal Mian · Mohit Bansal · Chen Chen
|
ExHall D Poster #311 | |
TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
Poster Session 5
linwei dong · Qingnan Fan · Yihong Guo · Zhonghao Wang · Qi Zhang · Jinwei Chen · Yawei Luo · Changqing Zou
|
ExHall D Poster #202 | |
Towards Universal Soccer Video Understanding
Poster Session 2
Jiayuan Rao · Haoning Wu · Hao Jiang · Ya Zhang · Yanfeng Wang · Weidi Xie
|
ExHall D Poster #288 | |
One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
Poster Session 5
Senmao Li · Lei Wang · Kai Wang · Tao Liu · Jiehang Xie · Joost van de Weijer · Fahad Shahbaz Khan · Shiqi Yang · Yaxing Wang · Jian Yang
|
ExHall D Poster #240 | |
Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning
Poster Session 1
Xueyi Ke · Satoshi Tsutsui · Yayun Zhang · Bihan Wen
|
ExHall D Poster #401 | |
Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?
Poster Session 4
Yanbo Wang · Jiyang Guan · Jian Liang · Ran He
|
ExHall D Poster #388 | |
Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning
Poster Session 6
Jiuyang Dong · Junjun Jiang · Kui Jiang · Jiahan Li · Yongbing Zhang
|
ExHall D Poster #447 | |
3D Gaussian Inpainting with Depth-Guided Cross-View Consistency
Poster Session 6
Sheng-Yu Huang · Zi-Ting Chou · Yu-Chiang Frank Wang
|
ExHall D Poster #52 | |
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
Poster Session 3
Yung-Hsuan Lai · Janek Ebbers · Yu-Chiang Frank Wang · François Germain · Michael J. Jones · Moitreya Chatterjee
|
ExHall D Poster #278 | |
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
JUNSEONG KIM · GeonU Kim · Kim Yu-Ji · Yu-Chiang Frank Wang · Jaesung Choe · Tae-Hyun Oh
|
ExHall D Poster #334 | |
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Poster Session 3
Junha Lee · Chunghyun Park · Jaesung Choe · Yu-Chiang Frank Wang · Jan Kautz · Minsu Cho · Chris Choy
|
ExHall D Poster #330 | |
Segment Anything, Even Occluded
Poster Session 6
Wei-En Tai · Yu-Lin Shih · Cheng Sun · Yu-Chiang Frank Wang · Hwann-Tzong Chen
|
ExHall D Poster #309 | |
Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
Poster Session 4
Cheng Sun · Jaesung Choe · Charles Loop · Wei-Chiu Ma · Yu-Chiang Frank Wang
|
ExHall D Poster #32 | |
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
Poster Session 4
Chi-Pin Huang · Yen-Siang Wu · Hung-Kai Chung · Kai-Po Chang · Fu-En Yang · Yu-Chiang Frank Wang
|
ExHall D Poster #172 | |
Learning Endogenous Attention for Incremental Object Detection
Poster Session 6
Xiang Song · Yuhang He · Jingyuan Li · Qiang Wang · Yihong Gong
|
ExHall D Poster #403 | |
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
Poster Session 1
Miran Heo · Min-Hung Chen · De-An Huang · Sifei Liu · Subhashree Radhakrishnan · Seon Joo Kim · Yu-Chiang Frank Wang · Ryo Hachiuma
|
ExHall D Poster #357 | |
Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints
Poster Session 1
Yuhao Zhou · Yuxin Tian · Jindi Lv · Mingjia Shi · Yuanxi Li · Qing Ye · Shuhao Zhang · Jiancheng Lv
|
ExHall D Poster #448 | |
Docopilot: Improving Multimodal Models for Document-Level Understanding
Poster Session 1
Yuchen Duan · Zhe Chen · Yusong Hu · Weiyun Wang · Shenglong Ye · Botian Shi · Lewei Lu · Qibin Hou · Tong Lu · Hongsheng Li · Jifeng Dai · Wenhai Wang
|
ExHall D Poster #367 | |
OmniStyle: Filtering High Quality Style Transfer Data at Scale
Poster Session 2
Ye Wang · Ruiqi Liu · Jiang Lin · Fei Liu · Zili Yi · Yilin Wang · Rui Ma
|
ExHall D Poster #237 | |
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
Benquan Wang · Ruyi An · Jin-Kyu So · Sergei Kurdiumov · Eng Aik Chan · Giorgio Adamo · Yuhan Peng · Yewen Li · Bo An
|
ExHall D Poster #23 | |
Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need
Poster Session 1
Qiang Wang · Xiang Song · Yuhang He · Jizhou Han · Chenhao Ding · Xinyuan Gao · Yihong Gong
|
ExHall D Poster #447 | |
Dynamic Integration of Task-Specific Adapters for Class Incremental Learning
Poster Session 6
Jiashuo Li · Shaokun Wang · Bo Qian · Yuhang He · Xing Wei · Qiang Wang · Yihong Gong
|
ExHall D Poster #421 | |
One-Step Event-Driven High-Speed Autofocus
Poster Session 2
Yuhan Bao · Shaohua Gao · Wenyong Li · Kaiwei Wang
|
ExHall D Poster #75 | |
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
Poster Session 3
Zhuguanyu Wu · Shihe Wang · Jiayi Zhang · Jiaxin Chen · Yunhong Wang
|
ExHall D Poster #405 | |
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
Poster Session 2
Zhuguanyu Wu · Jiayi Zhang · Jiaxin Chen · Jinyang Guo · Di Huang · Yunhong Wang
|
ExHall D Poster #411 | |
3D-SLNR: A Super Lightweight Neural Representation for Large-scale 3D Mapping
Poster Session 6
Chenhui Shi · Fulin Tang · Ning An · Yihong Wu
|
ExHall D Poster #103 | |
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
Poster Session 5
Daiqing Qi · Handong Zhao · Jing Shi · Simon Jenni · Yifei Fan · Franck Dernoncourt · Scott Cohen · Sheng Li
|
ExHall D Poster #361 | |
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
Xuanchi Ren · Tianchang Shen · Jiahui Huang · Huan Ling · Yifan Lu · Merlin Nimier-David · Thomas Müller · Alexander Keller · Sanja Fidler · Jun Gao
|
ExHall D Poster #65 | |
Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation
Poster Session 5
Yiftach Edelstein · Or Patashnik · Dana Cohen-Bar · Lihi Zelnik-Manor
|
ExHall D Poster #39 | |
Reference-Based 3D-Aware Image Editing with Triplanes
Bahri Batuhan Bilecen · Yiğit Yalın · Ning Yu · Aysegul Dundar
|
ExHall D Poster #44 | |
Structure-from-Motion with a Non-Parametric Camera Model
Yihan Wang · Linfei Pan · Marc Pollefeys · Viktor Larsson
|
ExHall D Poster #81 | |
MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting
Poster Session 4
jun huang · Ting Liu · Yihang Wu · Xiaochao Qu · Luoqi Liu · Xiaolin Hu
|
ExHall D Poster #241 | |
Animate and Sound an Image
Poster Session 5
Xihua Wang · Ruihua Song · Chongxuan Li · Xin Cheng · Boyuan Li · Yihan Wu · Yuyue Wang · Hongteng Xu · Yunfeng Wang
|
ExHall D Poster #221 | |
Argus: A Compact and Versatile Foundation Model for Vision
Poster Session 1
Weiming Zhuang · Chen Chen · Zhizhong Li · Sina Sajadmanesh · Jingtao Li · Jiabo Huang · Vikash Sehwag · Vivek Sharma · Hirotaka Shinozaki · Felan Carlo Garcia · Yihao Zhan · Naohiro Adachi · Ryoji Eki · Michael Spranger · Peter Stone · Lingjuan Lyu
|
ExHall D Poster #408 | |
Estimating Body and Hand Motion in an Ego‑sensed World
Poster Session 2
Brent Yi · Vickie Ye · Maya Zheng · Yunqi Li · Lea Müller · Georgios Pavlakos · Yi Ma · Jitendra Malik · Angjoo Kanazawa
|
ExHall D Poster #164 | |
HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation
Poster Session 1
Yiming Liang · Tianhan Xu · Yuta Kikuchi
|
ExHall D Poster #67 | |
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
Poster Session 3
Taha Koleilat · Hojat Asgariandehkordi · Hassan Rivaz · Yiming Xiao
|
ExHall D Poster #394 | |
EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision
Poster Session 6
Yiming Zhao · Taein Kwon · Paul Streli · Marc Pollefeys · Christian Holz
|
ExHall D Poster #149 | |
TFCustom: Customized Image Generation with Time-Aware Frequency Feature Guidance
Mushui Liu · Dong She · Qihan Huang · Jiacheng Ying · Wanggui He · Jingxuan Pang · Yuanlei Hou · Siming Fu
|
ExHall D Poster #244 | |
RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance
Poster Session 3
Yuheng Jiang · Zhehao Shen · Chengcheng Guo · Yu Hong · Zhuo Su · Yingliang Zhang · Marc Habermann · Lan Xu
|
ExHall D Poster #66 | |
Robust Multimodal Survival Prediction with Conditional Latent Differentiation Variational AutoEncoder
Poster Session 2
Junjie Zhou · Jiao Tang · Yingli Zuo · Peng Wan · Daoqiang Zhang · WEI SHAO
|
ExHall D Poster #477 | |
MotiF: Making Text Count in Image Animation with Motion Focal Loss
Poster Session 2
Shijie Wang · Samaneh Azadi · Rohit Girdhar · Sai Saketh Rambhatla · Chen Sun · Xi Yin
|
ExHall D Poster #230 | |
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
Poster Session 2
Jiayi Gao · Zijin Yin · Changcheng Hua · Yuxin Peng · Kongming Liang · Zhanyu Ma · Jun Guo · Yang Liu
|
ExHall D Poster #175 | |
Zero-Shot Monocular Scene Flow Estimation in the Wild
Poster Session 5
Yiqing Liang · Abhishek Badki · Hang Su · James Tompkin · Orazio Gallo
|
ExHall D Poster #165 | |
Domain Adaptive Diabetic Retinopathy Grading with Model Absence and Flowing Data
Poster Session 6
Wenxin Su · Song Tang · Xiaofeng Liu · Xiaojing Yi · Mao Ye · Chunxiao Zu · Jiahao Li · Xiatian Zhu
|
ExHall D Poster #207 | |
Learning Visual Composition through Improved Semantic Guidance
Poster Session 1
Austin Stone · Hagen Soltau · Robert Geirhos · Xi Yi · Ye Xia · Bingyi Cao · Kaifeng Chen · Abhijit Ogale · Jonathon Shlens
|
ExHall D Poster #340 | |
Dynamic Motion Blending for Versatile Motion Editing
Poster Session 5
Nan Jiang · Hongjie Li · Ziye Yuan · Zimo He · Yixin Chen · Tengyu Liu · Yixin Zhu · Siyuan Huang
|
ExHall D Poster #159 | |
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Poster Session 1
Meng Lou · Yizhou Yu
|
ExHall D Poster #395 | |
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Poster Session 4
Yunxiang Fu · Meng Lou · Yizhou Yu
|
ExHall D Poster #313 | |
BOE-ViT: Boosting Orientation Estimation with Equivariance in Self-Supervised 3D Subtomogram Alignment
Poster Session 6
Runmin Jiang · Jackson Daggett · Shriya Pingulkar · Yizhou Zhao · Priyanshu Dhingra · Daniel Brown · Qifeng Wu · Xiangrui Zeng · Xingjian Li · Min Xu
|
ExHall D Poster #306 | |
Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping
Poster Session 1
Guannan Lai · Yujie Li · Xiangkun Wang · Junbo Zhang · Tianrui Li · Xin Yang
|
ExHall D Poster #452 | |
VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow
Poster Session 4
Yancong Lin · Shiming Wang · Liangliang Nan · Julian F. P. Kooij · Holger Caesar
|
ExHall D Poster #128 | |
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
Poster Session 2
Kailin Li · Puhao Li · Tengyu Liu · Yuyang Li · Siyuan Huang
|
ExHall D Poster #155 | |
LOD-GS: Achieving Levels of Detail using Scalable Gaussian Soup
Poster Session 1
Jianxiong Shen · Yue Qian · Xiaohang Zhan
|
ExHall D Poster #47 | |
Embodied Scene Understanding for Vision Language Models via MetaVQA
Poster Session 5
Weizhen Wang · Chenda Duan · Zhenghao Peng · Yuxin Liu · Bolei Zhou
|
ExHall D Poster #133 | |
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
Poster Session 6
Runhui Huang · Xinpeng Ding · Chunwei Wang · Jianhua Han · Yulong Liu · Hengshuang Zhao · Hang Xu · Lu Hou · Wei Zhang · Xiaodan Liang
|
ExHall D Poster #351 | |
Revisiting Generative Replay for Class Incremental Object Detection
Poster Session 4
Shizhou Zhang · Xueqiang Lv · Yinghui Xing · Qirui Wu · Di Xu · Yanning Zhang
|
ExHall D Poster #432 | |
Low-Biased General Annotated Dataset Generation
Poster Session 5
Dengyang Jiang · Haoyu Wang · Lei Zhang · Wei Wei · Guang Dai · Mengmeng Wang · Jingdong Wang · Yanning Zhang
|
ExHall D Poster #389 | |
DIFFER: Disentangling Identity Features via Semantic Cues for Clothes-Changing Person Re-ID
Poster Session 3
Xin Liang · Yogesh S. Rawat
|
ExHall D Poster #318 | |
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images
Poster Session 6
Yuze He · Yanning Zhou · Wang Zhao · Zhongkai Wu · Kaiwen Xiao · Yang Wei · Yong-Jin Liu · Xiao Han
|
ExHall D Poster #15 | |
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
Poster Session 3
Lital Binyamin · Yoad Tewel · Hilit Segev · Eran Hirsch · Royi Rassin · Gal Chechik
|
ExHall D Poster #247 | |
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
Poster Session 1
Aaryan Garg · Akash Kumar · Yogesh S. Rawat
|
ExHall D Poster #307 | |
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding
Poster Session 2
Shehreen Azad · Vibhav Vineet · Yogesh S. Rawat
|
ExHall D Poster #303 | |
Latent Space Imaging
Poster Session 6
Matheus Souza · Yidan Zheng · Kaizhang Kang · Yogeshwar Nath Mishra · Qiang Fu · Wolfgang Heidrich
|
ExHall D Poster #203 | |
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model
Yue Han · Jiangning Zhang · Junwei Zhu · Runze Hou · Xiaozhong Ji · Chuming Lin · Xiaobin Hu · Xuezhucun Xue · Yong Liu
|
ExHall D Poster #359 | |
Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
Poster Session 5
Feng Zhou · Ruiyang Liu · chen liu · Gaofeng He · Yonglu Li · Xiaogang Jin · Huamin Wang
|
ExHall D Poster #257 | |
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Poster Session 6
Zixuan Chen · Jiaxin Li · Junxuan Liang · Liming Tan · Yejie Guo · Cewu Lu · Yonglu Li
|
ExHall D Poster #291 | |
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers
Poster Session 2
Nikaan Nikzad · YI LIAO · Yongsheng Gao · Jun Zhou
|
ExHall D Poster #415 | |
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
Poster Session 6
YONGWEI CHEN · Yushi Lan · Shangchen Zhou · Tengfei Wang · Xingang Pan
|
ExHall D Poster #210 | |
PersonaBooth: Personalized Text-to-Motion Generation
Poster Session 5
Boeun Kim · Hea In Jeong · JungHoon Sung · Yihua Cheng · Jeongmin Lee · Ju Yong Chang · Sang-Il Choi · YOUNGGEUN CHOI · Saim Shin · Jungho Kim · Hyung Jin Chang
|
ExHall D Poster #161 | |
CoMapGS: Covisibility Map-based Gaussian Splatting for Sparse Novel View Synthesis
Poster Session 6
Youngkyoon Jang · Eduardo Pérez-Pellitero
|
ExHall D Poster #61 | |
Recovering Dynamic 3D Sketches from Videos
Poster Session 3
Jaeah Lee · Changwoon Choi · Young Min Kim · Jaesik Park
|
ExHall D Poster #169 | |
Subnet-Aware Dynamic Supernet Training for Neural Architecture Search
Poster Session 6
Jeimin Jeon · Youngmin Oh · Junghyup Lee · Donghyeon Baek · Dohyung Kim · Chanho Eom · Bumsub Ham
|
ExHall D Poster #381 | |
SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning
Seokju Yun · Seunghye Chae · Dongheon Lee · Youngmin Ro
|
ExHall D Poster #436 | |
PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers
Poster Session 5
WooJu Lee · Juhye Park · Dasol Hong · Changki Sung · Youngwoo Seo · DongWan Kang · Hyun Myung
|
ExHall D Poster #89 | |
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
Poster Session 4
Dongshuo Yin · Leiyi Hu · Bin Li · Youqun Zhang · Xue Yang
|
ExHall D Poster #407 | |
RivuletMLP: An MLP-based Architecture for Efficient Compressed Video Quality Enhancement
Poster Session 2
Gang He · Weiran Wang · Guancheng Quan · Shihao Wang · Dajiang Zhou · Yunsong Li
|
ExHall D Poster #189 | |
FedCS: Coreset Selection for Federated Learning
Poster Session 3
Chenhe Hao · Weiying Xie · Daixun Li · Haonan Qin · Hangyu Ye · Leyuan Fang · Yunsong Li
|
ExHall D Poster #458 | |
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training
Poster Session 2
Dar-Yen Chen · Hmrishav Bandyopadhyay · Kai Zou · Yi-Zhe Song
|
ExHall D Poster #218 | |
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
Poster Session 1
Subhadeep Koley · Tapas Kumar Dutta · Aneeshan Sain · Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Yi-Zhe Song
|
ExHall D Poster #229 | |
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Poster Session 6
Aneeshan Sain · Subhajit Maity · Pinaki Nath Chowdhury · Subhadeep Koley · Ayan Kumar Bhunia · Yi-Zhe Song
|
ExHall D Poster #211 | |
Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes
Poster Session 4
Suhyun Shin · Seungwoo Yoon · Ryota Maeda · Seung-Hwan Baek
|
ExHall D Poster #72 | |
SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds
Poster Session 6
Jinfeng Xu · Xianzhi Li · Yuan Tang · Xu Han · Qiao Yu · yixue Hao · Long Hu · Min Chen
|
ExHall D Poster #109 | |
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
Poster Session 6
Jiamin WU · Kenkun Liu · Han Gao · Xiaoke Jiang · Yuan Yao · Lei Zhang
|
ExHall D Poster #46 | |
OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad
Poster Session 5
Luyao Tang · Chaoqi Chen · Yuxuan Yuan · Zeyu Zhang · Yue Huang · Kun Zhang
|
ExHall D Poster #418 | |
FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video
Poster Session 6
Yue Gao · Hong-Xing Yu · Bo Zhu · Jiajun Wu
|
ExHall D Poster #32 | |
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting
Poster Session 4
Hengyu Liu · Yuehao Wang · Chenxin Li · Ruisi Cai · Kevin Wang · Wuyang Li · Pavlo Molchanov · Peihao Wang · Zhangyang Wang
|
ExHall D Poster #47 | |
Learning Extremely High Density Crowds as Active Matters
Poster Session 1
Feixiang He · Jiangbei Yue · Jialin Zhu · Armin Seyfried · Dan Casas · Julien Pettré · He Wang
|
ExHall D Poster #35 | |
3D Student Splatting and Scooping
Poster Session 5
Jialin Zhu · Jiangbei Yue · Feixiang He · He Wang
|
ExHall D Poster #337 | |
Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations
Poster Session 4
Ahmad Rahimi · Po-Chien Luan · Yuejiang Liu · Frano Rajič · Alex Alahi
|
ExHall D Poster #139 | |
4D-Fly: Fast 4D Reconstruction from a Single Monocular Video
Poster Session 4
Diankun Wu · Fangfu Liu · Yi-Hsin Hung · Yue Qian · Xiaohang Zhan · Yueqi Duan
|
ExHall D Poster #79 | |
A Unified Framework for Heterogeneous Semi-supervised Learning
Poster Session 3
Marzi Heidari · Abdullah Alchihabi · Hao Yan · Yuhong Guo
|
ExHall D Poster #452 | |
Tartan IMU: A Light Foundation Model for Inertial Positioning in Robotics
Poster Session 5
Shibo Zhao · Sifan Zhou · Raphael Blanchard · Yuheng Qiu · Wenshan Wang · Sebastian Scherer
|
ExHall D Poster #139 | |
Yo’Chameleon: Personalized Vision and Language Generation
Poster Session 3
Thao Nguyen · Krishna Kumar Singh · Jing Shi · Trung Bui · Yong Jae Lee · Yuheng Li
|
ExHall D Poster #362 | |
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
Poster Session 6
Zechuan Li · Hongshan Yu · Yihao Ding · Jinhao Qiao · Basim Azam · Naveed Akhtar
|
ExHall D Poster #101 | |
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Jiangtong Tan · Hu Yu · Jie Huang · Jie Xiao · Feng Zhao
|
ExHall D Poster #172 | |
METASCENES: Towards Automated Replica Creation for Real-world 3D Scans
Poster Session 1
Huangyue Yu · Baoxiong Jia · Yixin Chen · Yandan Yang · Puhao Li · Rongpeng Su · Jiaxin Li · Qing Li · Wei Liang · Song-Chun Zhu · Tengyu Liu · Siyuan Huang
|
ExHall D Poster #140 | |
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Poster Session 6
Yuhui Zhang · Yuchang Su · Yiming Liu · Xiaohan Wang · James Burgess · Elaine Sui · Chenyu Wang · Josiah Aklilu · Alejandro Lozano · Anjiang Wei · Ludwig Schmidt · Serena Yeung
|
ExHall D Poster #327 | |
GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill
Poster Session 4
Jieming Cui · Tengyu Liu · Ziyu Meng · Jiale Yu · Ran Song · Wei Zhang · Yixin Zhu · Siyuan Huang
|
ExHall D Poster #145 | |
EdgeDiff: Edge-aware Diffusion Network for Building Reconstruction from Point Clouds
Poster Session 4
Yujun Liu · Ruisheng Wang · Shangfeng Huang · GuoRong Cai
|
ExHall D Poster #114 | |
SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
Poster Session 5
Cheng-De Fan · Chen-Wei Chang · Yi-Ruei Liu · Jie-Ying Lee · Jiun-Long Huang · Yu-Chee Tseng · Yu-Lun Liu
|
ExHall D Poster #27 | |
BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation
Poster Session 6
Yulu Pan · Ce Zhang · Gedas Bertasius
|
ExHall D Poster #267 | |
AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting
Poster Session 4
Chung-Ho Wu · Yang-Jung Chen · Ying-Huan Chen · Jie-Ying Lee · Bo-Hsu Ke · Chun-Wei Tuan Mu · Yichuan Huang · Chin-Yang Lin · Min-Hung Chen · Yen-Yu Lin · Yu-Lun Liu
|
ExHall D Poster #50 | |
GIFStream: 4D Gaussian-based Immersive Video with Feature Stream
Poster Session 5
Hao Li · Sicheng Li · Xiang Gao · AbudouaihatiBatuer · Lu Yu · Yiyi Liao
|
ExHall D Poster #67 | |
WISE: A Framework for Gigapixel Whole-Slide-Image Lossless Compression
Poster Session 6
Yu Mao · Jun Wang · Nan Guan · Chun Jason Xue
|
ExHall D Poster #305 | |
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head Avatars
Linzhou Li · Yumeng Li · Yanlin Weng · Youyi Zheng · Kun Zhou
|
ExHall D Poster #9 | |
DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis
Poster Session 6
Yuming Gu · Phong Tran · Yujian Zheng · Hongyi Xu · Heyuan Li · Adilbek Karmanov · Hao Li
|
ExHall D Poster #6 | |
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Poster Session 3
Haina Qin · Wenyang Luo · Zewen Chen · Yufan Liu · Bing Li · Weiming Hu · libin wang · DanDan Zheng · Yuming Li
|
ExHall D Poster #202 | |
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
Poster Session 2
Yifan Gao · Zihang Lin · Chuanbin Liu · Min Zhou · Tiezheng Ge · Bo Zheng · Hongtao Xie
|
ExHall D Poster #259 | |
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Poster Session 2
Yunseok Jang · Yeda Song · Sungryull Sohn · Lajanugen Logeswaran · Tiange Luo · Dong-Ki Kim · GyungHoon Bae · Honglak Lee
|
ExHall D Poster #308 | |
Event Ellipsometer: Event-based Mueller-Matrix Video Imaging
Ryota Maeda · Yunseong Moon · Seung-Hwan Baek
|
ExHall D Poster #72 | |
Olympus: A Universal Task Router for Computer Vision Tasks
Yuanze Lin · Yunsheng Li · Dongdong Chen · Weijian Xu · Ronald Clark · Philip H.S. Torr
|
ExHall D Poster #343 | |
GPVK-VL: Geometry-Preserving Virtual Keyframes for Visual Localization under Large Viewpoint Changes
Poster Session 4
Yunxuan Li · Lei Fan · Xiaoying Xing · Jianxiong Zhou · Ying Wu
|
ExHall D Poster #86 | |
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Poster Session 3
Shenghai Yuan · Jinfa Huang · Xianyi He · Yunyang Ge · Yujun Shi · Liuhan Chen · Jiebo Luo · Li Yuan
|
ExHall D Poster #222 | |
RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing
Poster Session 1
Zhipeng Huang · Wangbo Yu · Xinhua Cheng · ChengShu Zhao · Yunyang Ge · Mingyi Guo · Li Yuan · Yonghong Tian
|
ExHall D Poster #38 | |
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
Poster Session 1
Shiyao Li · Yingchun Hu · Xuefei Ning · Xihui Liu · Ke Hong · xiaotao jia · Xiuhong Li · Yaqi Yan · PEI RAN · Guohao Dai · Shengen Yan · Huazhong Yang · Yu Wang
|
ExHall D Poster #382 | |
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
Poster Session 6
Zijian He · Yuwei Ning · Yipeng Qin · Guangrun Wang · Sibei Yang · Liang Lin · Guanbin Li
|
ExHall D Poster #19 | |
Associative Transformer
Poster Session 1
Yuwei Sun · Hideya Ochiai · Zhirong Wu · Stephen Lin · Ryota Kanai
|
ExHall D Poster #417 | |
Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement
Poster Session 5
Qiyuan Dai · Hanzhuo Huang · Yu Wu · Sibei Yang
|
ExHall D Poster #420 | |
Implicit Bias Injection Attacks against Text-to-Image Diffusion Models
Poster Session 6
Huayang Huang · Xiangye Jin · Jiaxu Miao · Yu Wu
|
ExHall D Poster #249 | |
D^3: Scaling Up Deepfake Detection by Learning from Discrepancy
Poster Session 5
Yongqi Yang · Zhihao Qian · Ye Zhu · Olga Russakovsky · Yu Wu
|
ExHall D Poster #271 | |
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Poster Session 6
Jinhong Deng · Yuhang Yang · Wen Li · Lixin Duan
|
ExHall D Poster #365 | |
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
Poster Session 1
Matt Deitke · Christopher Clark · Sangho Lee · Rohun Tripathi · Yue Yang · Jae Sung Park · Reza Salehi · Niklas Muennighoff · Kyle Lo · Luca Soldaini · Jiasen Lu · Taira Anderson · Erin Bransom · Kiana Ehsani · Huong Ngo · Yen-Sung Chen · Ajay Patel · Mark Yatskar · Chris Callison-Burch · Andrew Head · Rose Hendrix · Favyen Bastani · Eli VanderBilt · Nathan Lambert · Yvonne Chou · Arnavi Chheda-Kothary · Jenna Sparks · Sam Skjonsberg · Michael Schmitz · Aaron Sarnat · Byron Bischoff · Pete Walsh · Christopher Newell · Piper Wolters · Tanmay Gupta · Kuo-Hao Zeng · Jon Borchardt · Dirk Groeneveld · Crystal Nam · Sophie Lebrecht · Caitlin Wittlif · Carissa Schoenick · Oscar Michel · Ranjay Krishna · Luca Weihs · Noah A. Smith · Hannaneh Hajishirzi · Ross Girshick · Ali Farhadi · Aniruddha Kembhavi
|
ExHall D Poster #370 | |
ATP: Adaptive Threshold Pruning for Efficient Data Encoding in Quantum Neural Networks
Poster Session 4
Mohamed Afane · Gabrielle Ebbrecht · Ying Wang · Juntao Chen · Junaid Farooq
|
ExHall D Poster #440 | |
Can Text-to-Video Generation help Video-Language Alignment?
Poster Session 5
Luca Zanella · Massimiliano Mancini · Willi Menapace · Sergey Tulyakov · Yiming Wang · Elisa Ricci
|
ExHall D Poster #294 | |
PerLA: Perceptive 3D Language Assistant
Poster Session 3
Guofeng Mei · Wei Lin · Luigi Riz · Yujiao Wu · Fabio Poiesi · Yiming Wang
|
ExHall D Poster #355 | |
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
Sirui Xu · Hung Yu Ling · Yu-Xiong Wang · Liangyan Gui
|
ExHall D Poster #155 | |
Floating No More: Object-Ground Reconstruction from a Single Image
Poster Session 6
Yunze Man · Yichen Sheng · Jianming Zhang · Liangyan Gui · Yu-Xiong Wang
|
ExHall D Poster #94 | |
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation
Poster Session 2
Sirui Xu · Dongting Li · Yucheng Zhang · Xiyan Xu · Qi Long · Ziyin Wang · Yunzhi Lu · Shuchang Dong · Hezi Jiang · Akshat Gupta · Yu-Xiong Wang · Liangyan Gui
|
ExHall D Poster #162 | |
Video-Bench: Human-Aligned Video Generation Benchmark
Poster Session 4
Hui Han · Siyuan Li · Jiaqi Chen · Yiwen Yuan · Yuling Wu · Yufan Deng · Chak Tou Leong · Hanwen Du · Junchen Fu · Youhua Li · Jie Zhang · Chi Zhang · Li-jia Li · Yongxin Ni
|
ExHall D Poster #293 | |
Symbolic Representation for Any-to-Any Generative Tasks
Poster Session 6
Jiaqi Chen · Xiaoye Zhu · Yue Wang · Tianyang Liu · Xinhui Chen · Ying Chen · Chak Tou Leong · Yifei Ke · Joseph Liu · Yiwen Yuan · Julian McAuley · Li-jia Li
|
ExHall D Poster #157 | |
Visual Agentic AI for Spatial Reasoning with a Dynamic API
Poster Session 4
Damiano Marsili · Rohun Agrawal · Yisong Yue · Georgia Gkioxari
|
ExHall D Poster #347 | |
Self-Evolving Visual Concept Library using Vision-Language Critics
Poster Session 3
Atharva Sehgal · Patrick Yuan · Ziniu Hu · Yisong Yue · Jennifer J. Sun · Swarat Chaudhuri
|
ExHall D Poster #236 | |
GenVDM: Generating Vector Displacement Maps From a Single Image
Yuezhi Yang · Qimin Chen · Vladimir G. Kim · Siddhartha Chaudhuri · Qixing Huang · Zhiqin Chen
|
ExHall D Poster #44 | |
NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks
Chenyi Zhang · Ting Liu · Xiaochao Qu · Luoqi Liu · Yao Zhao · Yunchao Wei
|
ExHall D Poster #340 | |
One-Minute Video Generation with Test-Time Training
Poster Session 4
Jiarui Xu · Shihao Han · Karan Dalal · Daniel Koceja · Yue Zhao · Ka Chun Cheung · Yejin Choi · Jan Kautz · Yu Sun · Xiaolong Wang
|
ExHall D Poster #181 | |
Distilling Long-tailed Datasets
Poster Session 6
Zhenghao Zhao · Haoxuan Wang · Yuzhang Shang · Kai Wang · Yan Yan
|
ExHall D Poster #427 | |
Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining
Poster Session 2
Guanglu Dong · Tianheng Zheng · Yuanzhouhan Cao · Linbo Qing · Chao Ren
|
ExHall D Poster #201 | |
GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling
Poster Session 4
Yang Zheng · Menglei Chai · Delio Vicini · Yuxiao Zhou · Yinghao Xu · Leonidas Guibas · Gordon Wetzstein · Thabo Beeler
|
ExHall D Poster #17 | |
AIpparel: A Multimodal Foundation Model for Digital Garments
Poster Session 2
Kiyohiro Nakayama · Jan Ackermann · Timur Levent Kesdogan · Yang Zheng · Maria Korosteleva · Olga Sorkine-Hornung · Leonidas Guibas · Guandao Yang · Gordon Wetzstein
|
ExHall D Poster #264 | |
A Dataset for Semantic Segmentation in the Presence of Unknowns
Poster Session 1
Zakaria Laskar · Tomas Vojir · Matej Grcic · Iaroslav Melekhov · Shankar Gangisetty · Juho Kannala · Jiri Matas · Giorgos Tolias · C.V. Jawahar
|
ExHall D Poster #119 | |
ILIAS: Instance-Level Image retrieval At Scale
Poster Session 3
Giorgos Kordopatis-Zilos · Vladan Stojnić · Anna Manko · Pavel Suma · Nikolaos-Antonios Ypsilantis · Nikos Efthymiadis · Zakaria Laskar · Jiri Matas · Ondrej Chum · Giorgos Tolias
|
ExHall D Poster #395 | |
Re-thinking Temporal Search for Long-Form Video Understanding
Poster Session 2
Jinhui Ye · Zihan Wang · Haosen Sun · Keshigeyan Chandrasegaran · Zane Durante · Cristobal Eyzaguirre · Yonatan Bisk · Juan Carlos Niebles · Ehsan Adeli · Li Fei-Fei · Jiajun Wu · Manling Li
|
ExHall D Poster #306 | |
3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting
Poster Session 6
Qi Wu · Janick Martinez Esturo · Ashkan Mirzaei · Nicolas Moënne-Loccoz · Žan Gojčič
|
ExHall D Poster #28 | |
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
Poster Session 3
Weinan Jia · Mengqi Huang · Nan Chen · Lei Zhang · Zhendong Mao
|
ExHall D Poster #211 | |
Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?
Poster Session 6
Zebin You · Xinyu Zhang · Hanzhong Guo · Jingdong Wang · Chongxuan Li
|
ExHall D Poster #250 | |
Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts
Poster Session 3
Feng Liang · Haoyu Ma · Zecheng He · Tingbo Hou · Ji Hou · Kunpeng Li · Xiaoliang Dai · Felix Juefei-Xu · Samaneh Azadi · Animesh Sinha · Peizhao Zhang · Peter Vajda · Diana Marculescu
|
ExHall D Poster #238 | |
Beyond Human Perception: Understanding Multi-Object World from Monocular View
Poster Session 1
Keyu Guo · Yongle Huang · Shijie Sun · Xiangyu Song · Mingtao Feng · Zedong Liu · Huansheng Song · Tiantian Wang · Jianxin Li · Naveed Akhtar · Ajmal Mian
|
ExHall D Poster #341 | |
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
Poster Session 4
Darshana Saravanan · Varun Gupta · Darshan Singh S · Zeeshan Khan · Vineet Gandhi · Makarand Tapaswi
|
ExHall D Poster #298 | |
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Poster Session 6
Jianzong Wu · Chao Tang · Jingbo Wang · Yanhong Zeng · Xiangtai Li · Yunhai Tong
|
ExHall D Poster #240 | |
Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models
Poster Session 3
Hao Ren · Yiming Zeng · Zetong Bi · Zhaoliang Wan · Junlong Huang · Hui Cheng
|
ExHall D Poster #140 | |
Unlocking Generalization Power in LiDAR Point Cloud Registration
Poster Session 5
Zhenxuan Zeng · Qiao Wu · Xiyu Zhang · Lin Yuanbo Wu · Pei An · Jiaqi Yang · Ji Wang · Peng Wang
|
ExHall D Poster #114 | |
DeepLA-Net: Very Deep Local Aggregation Networks for Point Cloud Analysis
Poster Session 1
Ziyin Zeng · Mingyue Dong · Jian Zhou · Huan Qiu · Zhen Dong · Man Luo · Bijun Li
|
ExHall D Poster #108 | |
Motion Prompting: Controlling Video Generation with Motion Trajectories
Poster Session 1
Daniel Geng · Charles Herrmann · Junhwa Hur · Forrester Cole · Serena Zhang · Tobias Pfaff · Tatiana Lopez-Guevara · Yusuf Aytar · Michael Rubinstein · Chen Sun · Oliver Wang · Andrew Owens · Deqing Sun
|
ExHall D Poster #173 | |
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Poster Session 3
Kunpeng Qiu · Zhiqiang Gao · Zhiying Zhou · MINGJIE SUN · Yongxin Guo
|
ExHall D Poster #481 | |
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
Poster Session 6
Liao Shen · Tianqi Liu · Huiqiang Sun · Jiaqi Li · Zhiguo Cao · Wei Li · Chen Change Loy
|
ExHall D Poster #26 | |
CH3Depth: Efficient and Flexible Depth Foundation Model with Flow Matching
Jiaqi Li · Yiran Wang · Jinghong Zheng · Junrui Zhang · Liao Shen · Tianqi Liu · Zhiguo Cao
|
ExHall D Poster #178 | |
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
Poster Session 3
Yiran Wang · Jiaqi Li · Chaoyi Hong · Ruibo Li · Liusheng Sun · Xiao Song · Zhe Wang · Zhiguo Cao · Guosheng Lin
|
ExHall D Poster #110 | |
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
Poster Session 1
Xiao Guo · Xiufeng Song · Yue Zhang · Xiaohong Liu · Xiaoming Liu
|
ExHall D Poster #381 | |
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Poster Session 5
Hanhui Wang · Yihua Zhang · Ruizheng Bai · Yue Zhao · Sijia Liu · Zhengzhong Tu
|
ExHall D Poster #267 | |
Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
Xingguang Zhang · Nicholas M Chimitt · Xijun Wang · Yu Yuan · Stanley H. Chan
|
ExHall D Poster #184 | |
Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis
Yu Yuan · Xijun Wang · Yichen Sheng · Prateek Chennuri · Xingguang Zhang · Stanley H. Chan
|
ExHall D Poster #244 | |
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Models
Poster Session 4
Zhaochong An · Guolei Sun · Yun Liu · Runjia Li · Junlin Han · Ender Konukoglu · Serge Belongie
|
ExHall D Poster #113 | |
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models
Poster Session 5
Jinjin Zhang · qiuyu Huang · Junjie Liu · Xiefan Guo · Di Huang
|
ExHall D Poster #230 | |
Rethinking Spiking Self-Attention Mechanism: Implementing α-XNOR Similarity Calculation in Spiking Transformers
Poster Session 2
Yichen Xiao · Shuai Wang · Dehao Zhang · Wenjie Wei · Yimeng Shan · Xiaoli Liu · Yulin Jiang · Malu Zhang
|
ExHall D Poster #310 | |
GigaHands: A Massive Annotated Dataset of Bimanual Hand Activities
Rao Fu · Dingxi Zhang · Alex Jiang · Wanjia Fu · Austin Funk · Daniel Ritchie · Srinath Sridhar
|
ExHall D Poster #159 | |
Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction
Poster Session 5
Dong Li · Wenqi Zhong · Wei Yu · Yingwei Pan · Dingwen Zhang · Ting Yao · Junwei Han · Tao Mei
|
ExHall D Poster #151 | |
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
Poster Session 4
Haipeng Fang · Sheng Tang · Juan Cao · Enshuo Zhang · Fan Tang · Tong-yee Lee
|
ExHall D Poster #217 | |
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders
Poster Session 4
Rui Chen · Jianfeng Zhang · Yixun Liang · Guan Luo · Weiyu Li · Jiarui Liu · Xiu Li · Xiaoxiao Long · Jiashi Feng · Ping Tan
|
ExHall D Poster #38 | |
Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting
Poster Session 5
Maochen Yang · Zekun Li · Jian Zhang · Lei Qi · Yinghuan Shi
|
ExHall D Poster #326 | |
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction
Poster Session 2
Gehui Li · Bin Chen · Chen Zhao · Lei Zhang · Jian Zhang
|
ExHall D Poster #202 | |
Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering
Poster Session 3
Yuanlin Wang · Yiyang Zhang · Ruiqin Xiong · Jing Zhao · Jian Zhang · Xiaopeng Fan · Tiejun Huang
|
ExHall D Poster #72 | |
Adversarial Diffusion Compression for Real-World Image Super-Resolution
Poster Session 6
Bin Chen · Gehui Li · Rongyuan Wu · Xindong Zhang · Jie Chen · Jian Zhang · Lei Zhang
|
ExHall D Poster #195 | |
Dataset Distillation with Neural Characteristic Function: A Minmax Perspective
Shaobo Wang · Yicun Yang · Zhiyuan Liu · Chenghao Sun · Xuming Hu · Conghui He · Linfeng Zhang
|
ExHall D Poster #433 | |
ProReflow: Progressive Reflow with Decomposed Velocity
Poster Session 6
Lei Ke · Haohang Xu · Xuefei Ning · Yu Li · Jiajun Li · Haoling Li · Yuxuan Lin · Dongsheng Jiang · Yujiu Yang · Linfeng Zhang
|
ExHall D Poster #177 | |
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition
Poster Session 6
ZHANG LINTONG · Kang Yin · Seong-Whan Lee
|
ExHall D Poster #373 | |
Let's Verify and Reinforce Image Generation Step by Step
Poster Session 6
Renrui Zhang · Chengzhuo Tong · Zhizheng Zhao · Ziyu Guo · Haoquan Zhang · Manyuan Zhang · Jiaming Liu · Peng Gao · Hongsheng Li
|
ExHall D Poster #238 | |
Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
Poster Session 5
Hua Yu · Weiming Liu · Gui Xu · Yaqing Hou · Yew-Soon Ong · Qiang Zhang
|
ExHall D Poster #158 | |
Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model
Poster Session 2
Shengjun Zhang · Jinzhao Li · Xin Fei · Hao Liu · Yueqi Duan
|
ExHall D Poster #62 | |
Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks
Poster Session 2
Kairong Yu · Chengting Yu · Tianqing Zhang · Xiaochen Zhao · Shu Yang · Hongwei Wang · Qiang Zhang · Qi Xu
|
ExHall D Poster #328 | |
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts
Poster Session 2
Qizhou Chen · Chengyu Wang · Dakan Wang · Taolin Zhang · Wangyue Li · Xiaofeng He
|
ExHall D Poster #389 | |
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
Poster Session 1
Junzhe Chen · Tianshu Zhang · Shiyu Huang · Yuwei Niu · Linfeng Zhang · Lijie Wen · Xuming Hu
|
ExHall D Poster #386 | |
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR
Poster Session 5
Xugong Qin · peng zhang · Jun Jie Ou Yang · Gangyan Zeng · Yubo Li · Yuanyuan Wang · Wanqian Zhang · Pengwen Dai
|
ExHall D Poster #367 | |
PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Poster Session 6
Tongda Xu · Jiahao Li · Bin Li · Yan Wang · Ya-Qin Zhang · Yan Lu
|
ExHall D Poster #217 | |
TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression
Poster Session 6
Xinjie Wang · Yifan Zhang · Ting Liu · Xinpu Liu · Ke Xu · Jianwei Wan · Yulan Guo · Hanyun Wang
|
ExHall D Poster #110 | |
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Poster Session 2
Yifei Zhang · Chang Liu · Jin Wei · Xiaomeng Yang · Yu ZHOU · Can Ma · Xiangyang Ji
|
ExHall D Poster #376 | |
Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model
Poster Session 2
Yuxiang Mao · Zhenfeng Fan · Zhijie Zhang · Zhiheng Zhang · Shihong Xia
|
ExHall D Poster #15 | |
CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation
Bonan Li · Zicheng Zhang · Xingyi Yang · Xinchao Wang
|
ExHall D Poster #260 | |
CamPoint: Boosting Point Cloud Segmentation with Virtual Camera
Poster Session 3
Jianhui Zhang · Luo Yizhi · Zicheng Zhang · Xuecheng Nie · Bonan Li
|
ExHall D Poster #114 | |
Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection
Zihao Zhang · Aming Wu · Yahong Han
|
ExHall D Poster #342 | |
SerialGen: Personalized Image Generation by First Standardization Then Personalization
Poster Session 1
Cong Xie · Han Zou · Ruiqi Yu · Yan Zhang · Zhan Zhenpeng
|
ExHall D Poster #257 | |
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Poster Session 3
Xin Wen · Bingchen Zhao · Yilun Chen · Jiangmiao Pang · Xiaojuan Qi
|
ExHall D Poster #144 | |
Towards Universal Dataset Distillation via Task-Driven Diffusion
Poster Session 3
Ding Qi · Jian Li · Junyao Gao · Shuguang Dou · Ying Tai · Jianlong Hu · Bo Zhao · Yabiao Wang · Chengjie Wang · Cai Rong Zhao
|
ExHall D Poster #262 | |
TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting
Poster Session 1
Bojun Xiong · Jialun Liu · JiaKui Hu · Chenming Wu · Jinbo Wu · Xing Liu · Chen Zhao · Errui Ding · Zhouhui Lian
|
ExHall D Poster #36 | |
TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer
Poster Session 6
Jialun Liu · Jinbo Wu · Xiaobo Gao · JiaKui Hu · Bojun Xiong · Xing Liu · Chen Zhao · Hongbin Pei · Haocheng Feng · Yingying Li · Errui Ding · Jingdong Wang
|
ExHall D Poster #39 | |
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Poster Session 3
Xuan Shen · Weize Ma · Jing Liu · Changdi Yang · Rui Ding · Quanyi Wang · Henghui Ding · Wei Niu · Yanzhi Wang · Pu Zhao · Jun Lin · Jiuxiang Gu
|
ExHall D Poster #75 | |
GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
Poster Session 4
Yuchen Sun · Shanhui Zhao · Tao Yu · Hao Wen · Samith Va · Mengwei Xu · Yuanchun Li · Chongyang Zhang
|
ExHall D Poster #350 | |
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
Poster Session 3
Mengtian Li · Jinshu Chen · Wanquan Feng · Bingchuan Li · Fei Dai · Songtao Zhao · Qian HE
|
ExHall D Poster #235 | |
Learning from Neighbors: Category Extrapolation for Long-Tail Learning
Poster Session 6
Shizhen Zhao · Xin Wen · Jiahui Liu · Chuofan Ma · Chunfeng Yuan · Xiaojuan Qi
|
ExHall D Poster #415 | |
iSegMan: Interactive Segment-and-Manipulate 3D Gaussians
Poster Session 1
Yian Zhao · Wanshi Xu · Ruochong Zheng · Pengchong Qiao · Chang Liu · Jie Chen
|
ExHall D Poster #46 | |
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
Poster Session 2
Feifei Shao · Ping Liu · Zhao Wang · Yawei Luo · Hongwei Wang · Jun Xiao
|
ExHall D Poster #119 | |
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
Poster Session 2
Xin Zhang · Xue Yang · Yuxuan Li · Jian Yang · Ming-Ming Cheng · Xiang Li
|
ExHall D Poster #196 | |
Visual Persona: Foundation Model for Full-Body Human Customization
Poster Session 4
Jisu Nam · Soowon Son · Zhan Xu · Jing Shi · Difan Liu · Feng Liu · Seungryong Kim · Yang Zhou
|
ExHall D Poster #272 | |
Move-in-2D: 2D-Conditioned Human Motion Generation
Poster Session 5
Hsin-Ping Huang · Yang Zhou · Jui-Hsien Wang · Difan Liu · Feng Liu · Ming-Hsuan Yang · Zhan Xu
|
ExHall D Poster #162 | |
UHD-processer: Unified UHD Image Restoration with Progressive Frequency Learning and Degradation-aware Prompts
Poster Session 5
Yidi Liu · Dong Li · Xueyang Fu · Xin Lu · Jie Huang · Zheng-Jun Zha
|
ExHall D Poster #196 | |
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
Poster Session 4
Yawen Shao · Wei Zhai · Yuhang Yang · Hongchen Luo · Yang Cao · Zheng-Jun Zha
|
ExHall D Poster #147 | |
Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties
Poster Session 2
wenqiao Li · BoZhong Zheng · Xiaohao Xu · Jinye Gan · Fading Lu · Xiang Li · Na Ni · Zheng Tian · Xiaonan Huang · Shenghua Gao · Yingna Wu
|
ExHall D Poster #439 | |
Instruction-based Image Manipulation by Watching How Things Move
Mingdeng Cao · Xuaner Zhang · Yinqiang Zheng · Zhihao Xia
|
ExHall D Poster #243 | |
OSDFace: One-Step Diffusion Model for Face Restoration
Poster Session 3
Jingkai Wang · Jue Gong · Lin Zhang · Zheng Chen · Xing Liu · Hong Gu · Yutong Liu · Yulun Zhang · Xiaokang Yang
|
ExHall D Poster #189 | |
Splatter-360: Generalizable 360 Gaussian Splatting for Wide-baseline Panoramic Images
Poster Session 5
Zheng Chen · Chenming Wu · Zhelun Shen · Chen Zhao · Weicai Ye · Haocheng Feng · Errui Ding · Song-Hai Zhang
|
ExHall D Poster #51 | |
GauSTAR: Gaussian Surface Tracking and Reconstruction
Poster Session 4
Chengwei Zheng · Lixin Xue · Juan Jose Zarate · Jie Song
|
ExHall D Poster #67 | |
Panorama Generation From NFoV Image Done Right
Dian Zheng · Cheng Zhang · Xiao-Ming Wu · Cao Li · Chengfei Lv · Jian-Fang Hu · Wei-Shi Zheng
|
ExHall D Poster #53 | |
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Poster Session 1
Zhen Lv · Yangqi Long · Congzhentao Huang · Cao Li · Chengfei Lv · Hao Ren · Dian Zheng
|
ExHall D Poster #60 | |
Robust Message Embedding via Attention Flow-Based Steganography
Poster Session 3
Huayuan Ye · Shenzhuo Zhang · Shiqi Jiang · Jing Liao · Shuhang Gu · Dejun Zheng · Changbo Wang · Chenhui Li
|
ExHall D Poster #209 | |
Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
Zhengdi Yu · Stefanos Zafeiriou · Tolga Birdal
|
ExHall D Poster #148 | |
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors
Poster Session 4
Zhengfei Kuang · Tianyuan Zhang · Kai Zhang · Hao Tan · Sai Bi · Yiwei Hu · Zexiang Xu · Milos Hasan · Gordon Wetzstein · Fujun Luan
|
ExHall D Poster #177 | |
ADU: Adaptive Detection of Unknown Categories in Black-Box Domain Adaptation
Poster Session 6
Yushan Lai · Guowen Li · Haoyuan Liang · Juepeng Zheng · Zhiyu Ye
|
ExHall D Poster #425 | |
LiVOS: Light Video Object Segmentation with Gated Linear Matching
Poster Session 2
Qin Liu · Jianfeng Wang · Zhengyuan Yang · Linjie Li · Kevin Lin · Marc Niethammer · Lijuan Wang
|
ExHall D Poster #315 | |
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Poster Session 4
Kevin Qinghong Lin · Linjie Li · Difei Gao · Zhengyuan Yang · Shiwei Wu · Zechen Bai · Stan Weixian Lei · Lijuan Wang · Mike Zheng Shou
|
ExHall D Poster #352 | |
LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate
Haoyan Gong · Zhenrong Zhang · Yuzheng Feng · Anh Nguyen · Hongbin Liu
|
ExHall D Poster #193 | |
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver
Poster Session 2
Cong Wei · Haoxian Tan · Yujie Zhong · Yong Liu · Jie Hu · Dengjie Li · Zheng Zhao · Yujiu Yang
|
ExHall D Poster #341 | |
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Poster Session 1
Chaojun Ni · Guosheng Zhao · Xiaofeng Wang · Zheng Zhu · Wenkang Qin · Guan Huang · Chen Liu · Yuyin Chen · Yida Wang · Xueyang Zhang · Yifei Zhan · Kun Zhan · Peng Jia · XianPeng Lang · Xingang Wang · Wenjun Mei
|
ExHall D Poster #130 | |
Interactive Medical Image Analysis with Concept-based Similarity Reasoning
Poster Session 6
Ta Duc Huy · Sen Kim Tran · Phan Nguyen · Nguyen Hoang Tran · Tran Bao Sam · Anton van den Hengel · Zhibin Liao · Johan Verjans · Minh-Son To · Vu Minh Hieu Phan
|
ExHall D Poster #445 | |
TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model
Poster Session 5
Zhichao Zhai · Guikun Chen · Wenguan Wang · Dong Zheng · Jun Xiao
|
ExHall D Poster #11 | |
Rashomon Sets for Prototypical-Part Networks: Editing Interpretable Models in Real-Time
Poster Session 1
Jon Donnelly · Zhicheng Guo · Alina Jade Barnett · Hayden McTavish · Chaofan Chen · Cynthia Rudin
|
ExHall D Poster #418 | |
Layered Image Vectorization via Semantic Simplification
Poster Session 2
Zhenyu Wang · Jianxi Huang · Zhida Sun · Yuanhao Gong · Daniel Cohen-Or · Min Lu
|
ExHall D Poster #226 | |
RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos
Poster Session 2
Yuxin Yao · Zhi Deng · Junhui Hou
|
ExHall D Poster #14 | |
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Poster Session 5
Shihao Wang · Zhiding Yu · Xiaohui Jiang · Shiyi Lan · Min Shi · Nadine Chang · Jan Kautz · Ying Li · Jose M. Alvarez
|
ExHall D Poster #132 | |
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
Poster Session 6
Kai Wang · Zekai Li · Zhi-Qi Cheng · Samir Khaki · Ahmad Sajedi · Ramakrishna Vedantam · Konstantinos N. Plataniotis · Alexander G. Hauptmann · Yang You
|
ExHall D Poster #412 | |
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
Poster Session 4
Ziyu Yao · Xuxin Cheng · Zhiqi Huang · Lei Li
|
ExHall D Poster #319 | |
Words or Vision: Do Vision-Language Models Have Blind Faith in Text?
Poster Session 1
Ailin Deng · Tri Cao · Zhirui Chen · Bryan Hooi
|
ExHall D Poster #352 | |
AutoPresent: Designing Structured Visuals from Scratch
Poster Session 1
Jiaxin Ge · Zora Zhiruo Wang · Xuhui Zhou · Yi-Hao Peng · Sanjay Subramanian · Qinyue Tan · Maarten Sap · Alane Suhr · Daniel Fried · Graham Neubig · Trevor Darrell
|
ExHall D Poster #262 | |
ID-Patch: Robust ID Association for Group Photo Personalization
Poster Session 1
Yimeng Zhang · Tiancheng Zhi · Jing Liu · Shen Sang · Liming Jiang · Qing Yan · Sijia Liu · Linjie Luo
|
ExHall D Poster #270 | |
COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection
Poster Session 6
Jinqi Xiao · Shen Sang · Tiancheng Zhi · Jing Liu · Qing Yan · Linjie Luo · Bo Yuan
|
ExHall D Poster #379 | |
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
Poster Session 3
Hongjia Zhai · Hai Li · Zhenzhe Li · Xiaokun Pan · Yijia He · Guofeng Zhang
|
ExHall D Poster #332 | |
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition
Zheda Mai · Ping Zhang · Cheng-Hao Tu · Hong-You Chen · Quang-Huy Nguyen · Li Zhang · Wei-Lun Chao
|
ExHall D Poster #401 | |
Hierarchical Adaptive Filtering Network for Text Image Specular Highlight Removal
Poster Session 1
Zhi Jiang · Jingbo Hu · Ling Zhang · Gang Fu · Chunxia Xiao
|
ExHall D Poster #211 | |
FlexUOD: The Answer to Real-world Unsupervised Image Outlier Detection
Poster Session 3
Zhonghang Liu · Kun Zhou · Changshuo Wang · Daniel Lin · Jiangbo Lu
|
ExHall D Poster #434 | |
Point Clouds Meets Physics: Dynamic Acoustic Field Fitting Network for Point Cloud Understanding
Poster Session 5
Changshuo Wang · Shuting He · Xiang Fang · Jiawei Han · Zhonghang Liu · Xin Ning · Weijun Li · Prayag Tiwari
|
ExHall D Poster #108 | |
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Poster Session 6
Yiyu Zhuang · Jiaxi Lv · Hao Wen · Qing Shuai · Ailing Zeng · Hao Zhu · Shifeng Chen · Yujiu Yang · Xun Cao · Wei Liu
|
ExHall D Poster #10 | |
Volume Tells: Dual Cycle-Consistent Diffusion for 3D Fluorescence Microscopy De-noising and Super-Resolution
ZELIN LI · Chenwei Wang · Zhaoke Huang · Centre for Intelligent Multidimensional Data Analysis · Hong Kong Baptist University · Hong Kong Baptist University · Hong Kong Baptist University
|
ExHall D Poster #23 | |
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
Yiming Zhong · Qi Jiang · Jingyi Yu · Yuexin Ma
|
ExHall D Poster #145 | |
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
Poster Session 1
Yifei Liu · Zhihang Zhong · Yifan Zhan · Sheng Xu · Xiao Sun
|
ExHall D Poster #48 | |
Unboxed: Geometrically and Temporally Consistent Video Outpainting
Poster Session 2
Zhongrui Yu · Martina Megaro-Boldini · Robert Sumner · Abdelaziz Djelouah
|
ExHall D Poster #186 | |
Less is More: Efficient Model Merging with Binary Task Switch
Biqing Qi · Fangyuan Li · Zhen Wang · Junqi Gao · Dong Li · Peng Ye · Bowen Zhou
|
ExHall D Poster #442 | |
FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity
Poster Session 3
Jinxi Li · Ziyang Song · Siyuan Zhou · Bo Yang
|
ExHall D Poster #170 | |
KVQ: Boosting Video Quality Assessment via Saliency-guided Local Perception
Poster Session 1
Yunpeng Qu · Kun Yuan · Qizhi Xie · Ming Sun · Chao Zhou · Jian Wang
|
ExHall D Poster #186 | |
PIDSR: Complementary Polarized Image Demosaicing and Super-Resolution
Poster Session 4
Shuangfan Zhou · Chu Zhou · Youwei Lyu · Heng Guo · Zhanyu Ma · Boxin Shi · Imari Sato
|
ExHall D Poster #21 | |
Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Poster Session 6
Shangquan Sun · Wenqi Ren · Juxiang Zhou · Shu Wang · Jianhou Gan · Xiaochun Cao
|
ExHall D Poster #188 | |
Towards Open-Vocabulary Audio-Visual Event Localization
Poster Session 2
Jinxing Zhou · Dan Guo · Ruohao Guo · Yuxin Mao · Jingjing Hu · Yiran Zhong · Xiaojun Chang · Meng Wang
|
ExHall D Poster #286 | |
Audio-Visual Instance Segmentation
Poster Session 3
Ruohao Guo · Xianghua Ying · Yaru Chen · Dantong Niu · Guangyao Li · Liao Qu · Yanyu Qi · Jinxing Zhou · Bowei Xing · Wenzhen Yue · Ji Shi · Qixun Wang · Peiliang Zhang · Buwen Liang
|
ExHall D Poster #277 | |
Improving the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation
Poster Session 1
Fengfan Zhou · Bangjie Yin · Hefei Ling · Qianyu Zhou · Wenxuan Wang
|
ExHall D Poster #319 | |
MP-GUI: Modality Perception with MLLMs for GUI Understanding
Poster Session 6
Ziwei Wang · Weizhi Chen · Leyang Yang · Sheng Zhou · Shengchu Zhao · Hanbei Zhan · Jiongchao Jin · Liangcheng Li · Zirui Shao · Jiajun Bu
|
ExHall D Poster #342 | |
A Polarization-Aided Transformer for Image Deblurring via Motion Vector Decomposition
Duosheng Chen · Shihao Zhou · Jinshan Pan · Jinglei Shi · lishen qu · Jufeng Yang
|
ExHall D Poster #180 | |
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
Poster Session 4
Chong Bao · Xiyu Zhang · Zehao Yu · Jiale Shi · Guofeng Zhang · Songyou Peng · Zhaopeng Cui
|
ExHall D Poster #51 | |
ACAttack: Adaptive Cross Attacking RGB-T Tracker via Multi-Modal Response Decoupling
Poster Session 5
Xinyu Xiang · Qinglong Yan · HAO ZHANG · Jiayi Ma
|
ExHall D Poster #100 | |
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification
Poster Session 6
Yuhao Wang · Yongfeng Lv · Pingping Zhang · Huchuan Lu
|
ExHall D Poster #341 | |
CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning
Poster Session 6
Jiangpeng He · Zhihao Duan · Fengqing Zhu
|
ExHall D Poster #420 | |
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation
Jinnan Chen · Lingting Zhu · Zeyu HU · Shengju Qian · Yugang Chen · Xin Wang · Gim Hee Lee
|
ExHall D Poster #41 | |
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
Poster Session 5
Xiangyan Qu · Gaopeng Gou · Jiamin Zhuang · Jing Yu · Kun Song · Qihao Wang · Yili Li · Gang Xiong
|
ExHall D Poster #392 | |
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Poster Session 5
Yuanmin Tang · Jing Yu · Keke Gai · Jiamin Zhuang · Gang Xiong · Gaopeng Gou · Qi Wu
|
ExHall D Poster #359 | |
AdaDARE-gamma: Balancing Stability and Plasticity in Multi-modal LLMs through Efficient Adaptation
Poster Session 4
Jingyi Xie · Jintao Yang · Zhunchen Luo · Yunbo Cao · Qiang Gao · Mengyuan Zhang · Wenpeng Hu
|
ExHall D Poster #377 | |
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Poster Session 1
Zhenxing Zhang · Yaxiong Wang · Lechao Cheng · Zhun Zhong · Dan Guo · Meng Wang
|
ExHall D Poster #365 | |
Feature Spectrum Learning for Remote Sensing Change Detection
Poster Session 3
Qi Zang · Dong Zhao · Shuang Wang · Dou Quan · Licheng Jiao · Zhun Zhong
|
ExHall D Poster #191 | |
FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation
Poster Session 3
Dong Zhao · Jinlong Li · Shuang Wang · Mengyao Wu · Qi Zang · Nicu Sebe · Zhun Zhong
|
ExHall D Poster #421 | |
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Poster Session 6
Seung Hyun Lee · Jijun jiang · Yiran Xu · Zhuofang Li · Junjie Ke · Yinxiao Li · Junfeng He · Steven Hickson · Katie Datsenko · Sangpil Kim · Ming-Hsuan Yang · Irfan Essa · Feng Yang
|
ExHall D Poster #369 | |
HumanMM: Global Human Motion Recovery from Multi-shot Videos
Poster Session 1
Yuhong Zhang · Guanlin Wu · Ling-Hao Chen · Zhuokai Zhao · Jing Lin · Xiaoke Jiang · Jiamin WU · Zhuoheng Li · Hao Frank Yang · Haoqian Wang · Lei Zhang
|
ExHall D Poster #167 | |
Easy-editable Image Vectorization with Multi-layer Multi-scale Distributed Visual Feature Embedding
Poster Session 5
Ye Chen · Zhangli Hu · Zhongyin Zhao · Yupeng Zhu · Yue Shi · Yuxuan Xiong · Bingbing Ni
|
ExHall D Poster #219 | |
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
Poster Session 6
Yixuan Zhu · Haolin Wang · Shilin Ma · Wenliang Zhao · Yansong Tang · Lei Chen · Jie Zhou
|
ExHall D Poster #216 | |
EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion
Poster Session 6
Yixing Zhu · Qing Zhang · Yitong Wang · Yongwei Nie · Wei-Shi Zheng
|
ExHall D Poster #201 | |
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
Poster Session 3
Jianhao Zheng · Zihan Zhu · Valentin Bieri · Marc Pollefeys · Songyou Peng · Iro Armeni
|
ExHall D Poster #76 | |
EntropyMark: Towards More Harmless Backdoor Watermark via Entropy-based Constraint for Open-source Dataset Copyright Protection
Poster Session 6
Ming Sun · Rui Wang · Zixuan Zhu · Lihua Jing · Yuanfang Guo
|
ExHall D Poster #435 | |
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
Poster Session 3
Yan Wang · Baoxiong Jia · Ziyu Zhu · Siyuan Huang
|
ExHall D Poster #333 | |
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
Poster Session 5
Jiangyong Huang · Baoxiong Jia · Yan Wang · Ziyu Zhu · Xiongkun Linghu · Qing Li · Song-Chun Zhu · Siyuan Huang
|
ExHall D Poster #339 | |
VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction
Poster Session 2
Ziyue Zhu · Shenlong Wang · Jin Xie · Jiang-Jiang Liu · Jingdong Wang · Jian Yang
|
ExHall D Poster #133 | |
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting
Poster Session 3
Shaofei Cai · Zihao Wang · Kewei Lian · Zhancun Mu · Xiaojian Ma · Anji Liu · Yitao Liang
|
ExHall D Poster #142 | |
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
Poster Session 5
Zigeng Chen · Xinyin Ma · Gongfan Fang · Xinchao Wang
|
ExHall D Poster #218 | |
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Zihang Lai · Andrea Vedaldi
|
ExHall D Poster #167 | |
FedSPA: Generalizable Federated Graph Learning under Homophily Heterogeneity
Poster Session 3
Zihan Tan · Guancheng Wan · Wenke Huang · Guibin Zhang · He Li · Carl Yang · Mang Ye
|
ExHall D Poster #461 | |
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Chaoyou Fu · Yuhan Dai · Yongdong Luo · Lei Li · Shuhuai Ren · Renrui Zhang · Zihan Wang · Chenyu Zhou · Yunhang Shen · Mengdan Zhang · Peixian Chen · Yanwei Li · Shaohui Lin · Sirui Zhao · Ke Li · Tong Xu · Xiawu Zheng · Enhong Chen · Caifeng Shan · Ran He · Xing Sun
|
ExHall D Poster #295 | |
Frequency-Biased Synergistic Design for Image Compression and Compensation
Poster Session 3
Jiaming Liu · Qi Zheng · Zihao Liu · Yilian Zhong · Peiye Liu · Tao Liu · Shusong Xu · Yanheng Lu · Sicheng Li · Dimin Niu · Yibo Fan
|
ExHall D Poster #207 | |
Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry
Poster Session 2
Rui Wang · Shaocheng Jin · Ziheng Chen · Xiaoqing Luo · Xiaojun Wu
|
ExHall D Poster #279 | |
Rethinking Reconstruction and Denoising in the Dark: New Perspective, General Architecture and Beyond
Poster Session 1
Long Ma · Tengyu Ma · Ziye Li · Yuetong Wang · Jinyuan Liu · Chengpei Xu · Risheng Liu
|
ExHall D Poster #203 | |
R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner
Poster Session 4
Ziyi Bai · Hanxuan Li · Bin Fu · Chuyan Xiong · Ruiping Wang · Xilin Chen
|
ExHall D Poster #348 | |
Advancing Adversarial Robustness in GNeRFs: The IL2-NeRF Attack
Poster Session 4
Nicole Meng · Caleb Manicke · Ronak Sahu · Caiwen Ding · Yingjie Lao
|
ExHall D Poster #52 | |
Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement
Poster Session 3
Xinjie Li · Ziyi Chen · Xinlu Yu · Iek-Heng Chu · Peng Chang · Jing Xiao
|
ExHall D Poster #69 | |
Seeing the Abstract: Translating the Abstract Language for Vision Language Models
Poster Session 2
Davide Talon · Federico Girella · Ziyue Liu · Marco Cristani · Yiming Wang
|
ExHall D Poster #370 | |
Mind the Time: Temporally-Controlled Multi-Event Video Generation
Poster Session 5
Ziyi Wu · Aliaksandr Siarohin · Willi Menapace · Ivan Skorokhodov · Yuwei Fang · Varnith Chordia · Igor Gilitschenski · Sergey Tulyakov
|
ExHall D Poster #284 | |
VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging
Poster Session 4
Yufan He · Pengfei Guo · Yucheng Tang · Andriy Myronenko · Vishwesh Nath · Ziyue Xu · Dong Yang · Can Zhao · Benjamin D. Simon · Mason Belue · Stephanie Anne Harmon · Baris Turkbey · Daguang Xu · Wenqi Li
|
ExHall D Poster #481 | |
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Vishwesh Nath · Wenqi Li · Dong Yang · Andriy Myronenko · Yao Lu · Zhijian Liu · Danny Yin · Yucheng Tang · Pengfei Guo · Ziyue Xu · Can Zhao · Yufan He · Greg Heinrich · Mingxin Zheng · Benjamin D. Simon · Stephanie Anne Harmon · Michael Zephyr · Marc Edgar · Stephen R. Aylward · Pavlo Molchanov · Yan Mee LAW · Baris Turkbey · Holger R. Roth · Daguang Xu
|
ExHall D Poster #396 | |
AvatarArtist: Open-Domain 4D Avatarization
Poster Session 3
Hongyu Liu · Xuan Wang · Ziyu Wan · Yue Ma · Jingye Chen · Yanbo Fan · Yujun Shen · Yibing Song · Qifeng Chen
|
ExHall D Poster #10 | |
Plug-and-Play Versatile Compressed Video Enhancement
Poster Session 4
Huimin Zeng · Jiacheng Li · Zhiwei Xiong
|
ExHall D Poster #187 | |
Evaluating Vision-Language Models as Evaluators in Path Planning
Poster Session 2
Mohamed Aghzal · Xiang Yue · Erion Plaku · Ziyu Yao
|
ExHall D Poster #145 | |
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
Poster Session 5
Ziyu Zhao · Xiaoguang Li · Lingjia Shi · Nasrin Imanpour · Song Wang
|
ExHall D Poster #411 | |
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach
Poster Session 1
Vaibhav Rathore · Shubhranil B · Saikat Dutta · Sarthak Mehrotra · Zsolt Kira · Biplab Banerjee
|
ExHall D Poster #453 | |
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons
Poster Session 3
Andrew Szot · Bogdan Mazoure · Omar Attia · Aleksei Timofeev · Harsh Agrawal · R Devon Hjelm · Zhe Gan · Zsolt Kira · Alexander Toshev
|
ExHall D Poster #329 | |
OSV: One Step is Enough for High-Quality Image to Video Generation
Poster Session 3
Xiaofeng Mao · Zhengkai Jiang · Fu-Yun Wang · Jiangning Zhang · Hao Chen · Mingmin Chi · Yabiao Wang · Wenhan Luo
|
ExHall D Poster #185 | |
Adapting Dense Matching for Homography Estimation with Grid-based Acceleration
Poster Session 2
Kaining Zhang · Yuxin Deng · Jiayi Ma · Paolo Favaro
|
ExHall D Poster #84 | |
Assessing and Learning Alignment of Unimodal Vision and Language Models
Le Zhang · Qian Yang · Aishwarya Agrawal
|
ExHall D Poster #379 | |
MangaNinja: Line Art Colorization with Precise Reference Following
Zhiheng Liu · Ka Leong Cheng · Xi Chen · Jie Xiao · Hao Ouyang · Kai Zhu · Yu Liu · Yujun Shen · Qifeng Chen · Ping Luo
|
ExHall D Poster #21 | |
Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing
Poster Session 5
Zhuowei Li · Tianchen Zhao · Xiang Xu · Zheng Zhang · Zhihua Li · Xuanbai Chen · Qin ZHANG · Alessandro Bergamo · Anil Kumar Jain · Yifan Xing
|
ExHall D Poster #318 | |
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?
Poster Session 2
Renshuai Tao · Haoyu Wang · Yuzhe Guo · Hairong Chen · Li Zhang · Xianglong Liu · Yunchao Wei · Yao Zhao
|
ExHall D Poster #473 | |
MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
Poster Session 6
Tianyu Wang · Jianming Zhang · Haitian Zheng · Zhihong Ding · Scott Cohen · Zhe Lin · Wei Xiong · Chi-Wing Fu · Luis Figueroa · Soo Ye Kim
|
ExHall D Poster #199 | |
ObjectMover: Generative Object Movement with Video Prior
Poster Session 4
Xin Yu · Tianyu Wang · Soo Ye Kim · Paul Guerrero · Xi Chen · Qing Liu · Zhe Lin · Xiaojuan Qi
|
ExHall D Poster #179 | |
Generative Image Layer Decomposition with Visual Effects
Poster Session 2
Jinrui Yang · Qing Liu · Yijun Li · Soo Ye Kim · Daniil Pakhomov · Mengwei Ren · Jianming Zhang · Zhe Lin · Cihang Xie · Yuyin Zhou
|
ExHall D Poster #217 | |
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Poster Session 4
Orr Zohar · Xiaohan Wang · Yann Dubois · Nikhil Mehta · Tong Xiao · Philippe Hansen-Estruch · Licheng Yu · Xiaofang Wang · Felix Juefei-Xu · Ning Zhang · Serena Yeung · Xide Xia
|
ExHall D Poster #296 | |
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Poster Session 5
Zeyi Huang · Yuyang Ji · Xiaofang Wang · Nikhil Mehta · Tong Xiao · Donghyun Lee · Sigmund VanValkenburgh · Shengxin Zha · Bolin Lai · Licheng Yu · Ning Zhang · Yong Jae Lee · Miao Liu
|
ExHall D Poster #301 | |
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Poster Session 2
Antoni Bigata Casademunt · Michał Stypułkowski · Rodrigo Mira · Stella Bounareli · Konstantinos Vougioukas · Zoe Landgraf · Nikita Drobyshev · Maciej Zieba · Stavros Petridis · Maja Pantic
|
ExHall D Poster #3 | |
IRIS: Inverse Rendering of Indoor Scenes from Low Dynamic Range Images
Poster Session 1
Chih-Hao Lin · Jia-Bin Huang · Zhengqin Li · Zhao Dong · Christian Richardt · Michael Zollhoefer · Tuotuo Li · Johannes Kopf · Shenlong Wang · Changil Kim
|
ExHall D Poster #28 | |
Volumetric Surfaces: Representing Fuzzy Geometries with Layered Meshes
Poster Session 5
Stefano Esposito · Anpei Chen · Christian Reiser · Samuel Rota Bulò · Lorenzo Porzi · Katja Schwarz · Christian Richardt · Michael Zollhoefer · Peter Kontschieder · Andreas Geiger
|
ExHall D Poster #31 | |
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning
Poster Session 2
Di Zhang · Jingdi Lei · Junxian Li · Xunzhi Wang · Yujie Liu · Zonglin Yang · Jiatong LI · Weida Wang · Suorong Yang · Jianbo Wu · Peng Ye · Wanli Ouyang · Dongzhan Zhou
|
ExHall D Poster #352 | |
Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation
Poster Session 4
Jialai Wang · Yuxiao Wu · Weiye Xu · Yating Huang · Chao Zhang · Zongpeng Li · Mingwei Xu · Zhenkai Liang
|
ExHall D Poster #410 | |
Unified Reconstruction of Static and Dynamic Scenes from Events
Qiyao Gao · Peiqi Duan · Hanyue Lou · Minggui Teng · Ziqi Cai · Xu Chen · Boxin Shi
|
ExHall D Poster #166 | |
CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth
Poster Session 3
Zhiyu Qu · Yunqi Miao · Zhensong Zhang · Jifei Song · Jiankang Deng · Yi-Zhe Song
|
ExHall D Poster #15 | |
ImViD: Immersive Volumetric Videos for Enhanced VR Engagement
Poster Session 4
Zhengxian Yang · Shi Pan · Shengqi Wang · Haoxiang Wang · Li Lin · Guanjun Li · Zhengqi Wen · Borong Lin · Jianhua Tao · Tao Yu
|
ExHall D Poster #69 | |
Brain-Inspired Spiking Neural Networks for Energy-Efficient Object Detection
Poster Session 1
Ziqi Li · Tao Gao · Yisheng An · Ting Chen · Jing Zhang · Yuanbo Wen · Mengkun Liu · Qianxi Zhang
|
ExHall D Poster #322 | |
FFR: Frequency Feature Rectification for Weakly Supervised Semantic Segmentation
Poster Session 6
Ziqian Yang · Xinqiao Zhao · Xiaolei Wang · Quan Zhang · Jimin Xiao
|
ExHall D Poster #394 | |
Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition
Poster Session 5
Chengxiang Huang · Yake Wei · Zequn Yang · Di Hu
|
ExHall D Poster #463 | |
Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification
Poster Session 2
Zequn Zeng · Yudi Su · Jianqiao Sun · Tiansheng Wen · Hao Zhang · Zhengjue Wang · Bo Chen · Hongwei Liu · Jiawei Ma
|
ExHall D Poster #395 | |
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Poster Session 6
Yan Xie · Zequn Zeng · Hao Zhang · Yucheng Ding · Yi Wang · Zhengjue Wang · Bo Chen · Hongwei Liu
|
ExHall D Poster #388 | |
Flexible Group Count Enables Hassle-Free Structured Pruning
Poster Session 1
Jiamu Zhang · Shaochen (Henry) Zhong · Andrew Ye · Zirui Liu · Sebastian Zhao · Kaixiong Zhou · Li Li · Soo-Hyun Choi · Rui Chen · Xia Hu · Shuai Xu · Vipin Chaudhary
|
ExHall D Poster #444 | |
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
Poster Session 6
Hui En Pang · Shuai Liu · Zhongang Cai · Lei Yang · Tianwei Zhang · Ziwei Liu
|
ExHall D Poster #14 | |
EgoLM: Multi-Modal Language Model of Egocentric Motions
Poster Session 2
Fangzhou Hong · Vladimir Guzov · Hyo Jin Kim · Yuting Ye · Richard Newcombe · Ziwei Liu · Lingni Ma
|
ExHall D Poster #69 | |
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
Poster Session 1
Hermann Kumbong · Xian Liu · Tsung-Yi Lin · Ming-Yu Liu · Xihui Liu · Ziwei Liu · Daniel Y Fu · Christopher Re · David W. Romero
|
ExHall D Poster #227 | |
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Poster Session 2
Yuhao Dong · Zuyan Liu · Hai-Long Sun · Jingkang Yang · Winston Hu · Yongming Rao · Ziwei Liu
|
ExHall D Poster #353 | |
WildAvatar: Learning In-the-wild 3D Avatars from the Web
Poster Session 4
Zihao Huang · Shoukang Hu · Guangcong Wang · Tianqi Liu · Yuhang Zang · Zhiguo Cao · Wei Li · Ziwei Liu
|
ExHall D Poster #10 | |
S2D-LFE: Sparse-to-Dense Light Field Event Generation
Poster Session 3
Yutong Liu · Wenming Weng · Yueyi Zhang · Zhiwei Xiong
|
ExHall D Poster #53 | |
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
Poster Session 6
Jianping Jiang · Weiye Xiao · Zhengyu Lin · Huaizhong Zhang · Tianxiang Ren · Yang Gao · Zhiqian Lin · Zhongang Cai · Lei Yang · Ziwei Liu
|
ExHall D Poster #71 | |
Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion
Poster Session 6
Zhiqiang Yan · Zhengxue Wang · Kun Wang · Jun Li · Jian Yang
|
ExHall D Poster #76 | |
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Poster Session 3
Hui Zhang · Tingwei Gao · Jie Shao · Zuxuan Wu
|
ExHall D Poster #214 | |
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
Poster Session 1
Zihao Zhang · Haoran Chen · Haoyu Zhao · Guansong Lu · Yanwei Fu · Hang Xu · Zuxuan Wu
|
ExHall D Poster #182 | |
StableAnimator: High-Quality Identity-Preserving Human Image Animation
Poster Session 5
Shuyuan Tu · Zhen Xing · Xintong Han · Zhi-Qi Cheng · Qi Dai · Chong Luo · Zuxuan Wu
|
ExHall D Poster #5 | |
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Poster Session 4
Yicheng Chen · Xiangtai Li · Yining Li · Yanhong Zeng · Jianzong Wu · Xiangyu Zhao · Kai Chen
|
ExHall D Poster #395 | |
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Poster Session 2
Rang Meng · Xingyu Zhang · Yuming Li · Chenguang Ma
|
ExHall D Poster #4 | |
3D Prior Is All You Need: Cross-Task Few-shot 2D Gaze Estimation
Poster Session 5
Yihua Cheng · Hengfei Wang · Zhongqun Zhang · Yang Yue · Boeun Kim · Feng Lu · Hyung Jin Chang
|
ExHall D Poster #275 | |
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
Poster Session 4
Yongting Zhang · Lu Chen · Guodong Zheng · Yifeng Gao · Rui Zheng · Jinlan Fu · Zhenfei Yin · Senjie Jin · Yu Qiao · Xuanjing Huang · Feng Zhao · Tao Gui · Jing Shao
|
ExHall D Poster #387 | |
Dual Semantic Guidance for Open Vocabulary Semantic Segmentation
Poster Session 4
ZhengYang Wang · Tingliang Feng · Fan Lyu · Fanhua Shang · Wei Feng · Liang Wan
|
ExHall D Poster #420 | |
Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting
Jinbo Yan · Rui Peng · Zhiyan Wang · Luyang Tang · Jiayu Yang · Jie Liang · Jiahao Wu · Ronggang Wang
|
ExHall D Poster #65 | |
SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction
Poster Session 5
Kai Chen · Xiaodong Zhao · Yujie Huang · GuoyuFang · Xiao Song · Ruiping Wang · Ziyuan Wang
|
ExHall D Poster #134 | |
DIFIX3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
Poster Session 6
Jay Zhangjie Wu · Alex Zhang · Haithem Turki · Xuanchi Ren · Jun Gao · Mike Zheng Shou · Sanja Fidler · Žan Gojčič · Huan Ling
|
ExHall D Poster #57 | |
Reproducible Vision-Language Models Meet Concepts Out of Pre-Training
Poster Session 3
Ziliang Chen · Xin Huang · Xiaoxuan Fan · Keze Wang · Yuyu Zhou · Quanlong Guan · Liang Lin
|
ExHall D Poster #388 | |
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
Poster Session 2
Jian Yang · Dacheng Yin · Yizhou Zhou · Fengyun Rao · Wei Zhai · Yang Cao · Zheng-Jun Zha
|
ExHall D Poster #249 | |
Towards Smart Point-and-Shoot Photography
Poster Session 6
Jiawan Li · Fei Zhou · Zhipeng Zhong · Jiongzhi Lin · Guoping Qiu
|
ExHall D Poster #198 | |
Generative Video Propagation
Poster Session 4
Shaoteng Liu · Tianyu Wang · Jui-Hsien Wang · Qing Liu · Zhifei Zhang · Joon-Young Lee · Yijun Li · Bei Yu · Zhe Lin · Soo Ye Kim · Jiaya Jia
|
ExHall D Poster #182 | |
TransPixeler: Advancing Text-to-Video Generation with Transparency
Poster Session 4
Luozhou Wang · Yijun Li · ZhiFei Chen · Jui-Hsien Wang · Zhifei Zhang · He Zhang · Zhe Lin · Ying-Cong Chen
|
ExHall D Poster #232 | |
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Poster Session 5
Hang Hua · Qing Liu · Lingzhi Zhang · Jing Shi · Soo Ye Kim · Zhifei Zhang · Yilin Wang · Jianming Zhang · Zhe Lin · Jiebo Luo
|
ExHall D Poster #357 | |
TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting
Poster Session 2
Liangbin Xie · Daniil Pakhomov · Zhonghao Wang · Zongze Wu · Ziyan Chen · Yuqian Zhou · Haitian Zheng · Zhifei Zhang · Zhe Lin · Jiantao Zhou · Chao Dong
|
ExHall D Poster #214 | |
Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs
Yingji Zhong · Zhihao Li · Dave Zhenyu Chen · Lanqing Hong · Dan Xu
|
ExHall D Poster #66 | |
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
Ke Ma · Jiaqi Tang · Bin Guo · Fan Dang · Sicong Liu · Zhui Zhu · Lei Wu · Cheng Fang · Ying-Cong Chen · Zhiwen Yu · Yunhao Liu
|
ExHall D Poster #418 | |
The Scene Language: Representing Scenes with Programs, Words, and Embeddings
Yunzhi Zhang · Zizhang Li · Matt Zhou · Shangzhe Wu · Jiajun Wu
|
ExHall D Poster #344 | |
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Poster Session 1
Wei Cheng · Juncheng Mu · Xianfang Zeng · Xin Chen · Anqi Pang · Chi Zhang · Zhibin Wang · Bin Fu · Gang Yu · Ziwei Liu · Liang Pan
|
ExHall D Poster #39 | |
SocialGesture: Delving into Multi-person Gesture Understanding
Poster Session 4
Xu Cao · Pranav Virupaksha · Wenqi Jia · Bolin Lai · Fiona Ryan · Sangmin Lee · James Rehg
|
ExHall D Poster #353 | |
Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency
Alan Baade · Changan Chen
|
ExHall D Poster #89 |