Toggle Poster Visibility
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 1
MAMMA: Markerless Accurate Multi-person Motion Acquisition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 2
Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 3
PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 4
SAM 3D Body: Robust Full-Body Human Mesh Recovery
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 5
SAM 3D: 3Dfy Anything in Images
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 6
SPARK: Sim-ready Part-level Articulated Reconstruction with VLM Knowledge
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 7
3DReflecNet: A Large-Scale Dataset for 3D Reconstruction of Reflective, Transparent, and Low-Texture Objects
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 8
GLINT: Modeling Scene-Scale Transparency via Gaussian Radiance Transport
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 9
Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 10
PhyGaP: Physically-Grounded Gaussians with Polarization Cues
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 11
PPISP: Physically-Plausible Compensation and Control of Photometric Variations in Radiance Field Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 12
SeeGroup: Multi-Layer Depth Estimation of Transparent Surfaces via Self-Determined Grouping
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 13
Energy-GS: Image Energy-guided Pose Alignment Gaussian Splatting with redesigned pose gradient flow
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 14
MeshSplatting: Differentiable Rendering with Opaque Meshes
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 15
Proxy-GS: Unified Occlusion Priors for Training and Inference in Structured 3D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 16
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 17
Selfi: Self-improving Reconstruction Engine via 3D Geometric Feature Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 18
Z-Order Transformer for Feed-Forward Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 19
4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 20
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 21
FUSER: Feed-Forward Multiview 3D Registration Transformer and SE(3)^N Diffusion Refinement
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 22
Residual Primitive Fitting of 3D Shapes with SuperFrusta
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 23
SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 24
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 25
Affostruction: 3D Affordance Grounding with Generative Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 26
MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 27
Unified Primitive Proxies for Structured Shape Completion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 28
ART: Articulated Reconstruction Transformer
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 29
SCE-SLAM: Scale-Consistent Monocular SLAM via Scene Coordinate Embeddings
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 30
S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 31
Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 32
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 33
E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 34
QVGGT: Post-Training Quantized Visual Geometry Grounded Transformer
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 35
SRGCD: Stability-Driven Region Growth Framework for 3D Change Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 36
D-Prism: Differentiable Primitives for Structured Dynamic Modeling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 37
STAC: Plug-and-Play Spatio-Temporal Aware Cache Compression for Streaming 3D Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 38
Stabilizing Streaming Video Geometry via Dynamic Feature Normalization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 39
LaS-Comp: Zero-shot 3D Completion with Latent–Spatial Consistency
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 41
EfficientMonoHair: Fast Strand-Level Reconstruction from Monocular Video via Multi-View Direction Fusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 42
OSPO: Object-Centric Self-Improving Preference Optimization for Text-to-Image Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 43
MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 44
StyleTextGen: Style-Conditioned Multilingual Scene Text Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 45
CRAFT-LoRA: Content-Style Personalization via Rank-Constrained Adaptation and Training-Free Fusion
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 46
OneHOI: Unifying Human-Object Interaction Generation and Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 47
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 48
Self-Paced and Self-Corrective Masked Prediction for Movie Trailer Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 49
TV2TV: A Unified Framework for Interleaved Language and Video Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 50
Narrative Weaver: Towards Controllable Long-Range Visual Consistency with Multi-Modal Conditioning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 51
Ref4D-VideoBench: Four-Dimensional Reference-Based Evaluation of Text-to-Video Generative Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 52
PureCC: Pure Learning for Text-to-Image Concept Customization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 53
Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 54
Yume1.5: A Text-Controlled Interactive World Generation Model
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 55
PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 56
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 57
SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 58
PROMPTMINER: Black-Box Prompt Stealing against Text-to-Image Generative Models via Reinforcement Learning and VLM-Guided Optimization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 59
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 60
Self-Evaluation Unlocks Any-Step Text-to-Image Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 61
Say Cheese! Detail-Preserving Portrait Collection Generation via Natural Language Edits
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 62
LVLM-Aided Alignment of Task-Specific Vision Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 63
DeepAlign: Mitigating Modality Conflict through Modality-Specific Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 64
PG-VTON: Single-Pass Training-Free Virtual Try-On via Patch-Guided Reference Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 65
Linguistic Priors for Visual Decoupling: Towards Symmetric Vision-Brain Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 66
Scaling Spatial Intelligence with Multimodal Foundation Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 67
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 68
SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 69
AVATAR: Reinforcement Learning to See, Hear, and Reason Over Video
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 70
CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 71
FOZO: Forward-Only Zeroth-Order Prompt Optimization for Test-Time Adaptation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 72
Language Does Matter for Cross-Domain Few-Shot Visual Feature Enhancement
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 73
Back to Source: Open-Set Continual Test-Time Adaptation via Domain Compensation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 74
Bridging Domain Expertise and Generalization for Performance Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 75
Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 76
Bridging Domains through Subspace-Aware Model Merging
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 77
DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 78
Scaling Dense Event-Stream Pretraining from Visual Foundation Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 79
Event Stream Filtering via Probability Flux Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 80
AIMDepth: Asymmetric Image-Event Mamba for Monocular Depth Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 81
Time-Specialized Event-Image Alignment for Blur-to-Video Decomposition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 82
eRetinexGS: Retinex Modeling for Low-Light Scene Enhancement via Event Streams and 3D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 83
Unsupervised 3d Motion Estimation Using Event Camera
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 85
ModularAgent: A Task-Aware Modular Framework for Joint Optimization of Multimodal Large Language Models and World Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 86
AstraNav-Memory: Contexts Compression for Long Memory
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 87
Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 88
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 89
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 90
ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 91
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 92
BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 93
SyncMos: Scalable Motion Synchronisation for Multi-Agent Scene Interaction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 94
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 95
Omni-Attribute: Open-vocabulary Attribute Encoder for Visual Concept Personalization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 96
IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 97
InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 98
MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 99
The Devil is in Attention Sharing: Improving Complex Non-rigid Image Editing Faithfulness via Attention Synergy
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 100
ShreddingNet: Coarse-to-Fine Restoration for Multi-Source Shredded Manuscripts
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 101
Image Guides Images: Consistent Video Amodal Completion with Rectified In-Context Exemplar Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 102
Radiance Meshes for Volumetric Reconstruction
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 104
CoRoGS: Contextual Gaussian Splatting for Robust Large-Deviation View Synthesis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 105
ChronoGS: Disentangling Invariants and Changes in Multi-Period Scenes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 106
Real-Time Dynamic Scene Rendering with Controlled Compressibility and Contact Awareness
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 107
Splatent: Splatting Diffusion Latents for Novel View Synthesis
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 109
Dynamic-Static Decomposition for Novel View Synthesis of Dynamic Scenes with Spiking Neurons
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 110
DiffSoup: Direct Differentiable Rasterization of Triangle Soup for Extreme Radiance Field Simplification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 111
Gyro-based Deep Video Deblurring
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 112
Residual Diffusion Bridge Model for Image Restoration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 113
MMDIR: Multimodal Instruction-Driven Framework for Mixed-Degradation Document Image Restoration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 114
Rectifying Latent Space for Generative Single-Image Reflection Removal
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 115
Towards Generalized Multimodal Homography Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 116
Edit-aware RAW reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 117
Face2Scene: Using Facial Degradation as an Oracle for Diffusion-Based Scene Restoration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 118
HG-Lane: High-Fidelity Generation of Lane Scenes under Adverse Weather and Lighting Conditions without Re-annotation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 120
MR. Illuminate: Zero-Shot Low-Light Image Enhancement with Diffusion Prior
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 121
FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 122
SPEGC: Continual Test-Time Adaptation via Semantic-Prompt-Enhanced Graph Clustering for Medical Image Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 123
BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 125
CROWn: A Unified Framework for Anti‑Aliased Downsampling and Phase‑Calibrated Fusion in 3D Medical Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 126
Rethinking Box Supervision: Bias-Free Weakly Supervised Medical Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 127
Semi-supervised Echocardiography Video Segmentation via Anchor Semantic Awareness and Continuous Pseudo-label Reforging
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 130
Breaking Multimodal LLM Safety via Video-Driven Prompting
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 131
When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 132
RecoverMark: Robust Watermarking for Localization and Recovery of Manipulated Faces
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 133
A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 134
FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 135
PureProof: Diffusion-Resistant Black-box Targeted Attack on Large Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 136
UniDef: Universal Defense Against Unauthorized Image Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 137
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 138
MERLIN: Building Low-SNR Robust Multimodal LLMs for Electromagnetic Signals
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 139
Rethinking Cross-Modal Anchor Alignment for Mitigating Error Accumulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 141
Omni-MMSI: Toward Identity-attributed Social Interaction Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 142
Inconsistency-aware Multimodal Schrödinger Bridge for Deepfake Localization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 143
MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 144
Seeing Through Touch: Tactile-Driven Visual Localization of Material Regions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 145
Seeing What Matters: A Training-Free Self-Guided Framework for Multimodal Detail Perception and Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 146
Illuminating Visual Identity in Universal Multimodal Embeddings
[
Slides]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 147
Anti-Degradation Lifelong Multi-View Clustering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 148
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 149
Efficient and High-Fidelity Omni Modality Retrieval
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 150
Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 151
Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 152
HAVE-Bench: Hierarchical Audio-Visual Evaluation from Perception to Interaction
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 153
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 154
THE MORE, THE MERRIER: CONTRASTIVE FUSION FOR HIGHER-ORDER MULTIMODAL ALIGNMENT
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 155
CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 157
UST-Hand: An Uncertainty-aware Spatiotemporal Point Cloud Interaction Network for 3D Self-supervised Hand Pose Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 158
ForeHOI: Feed-forward 3D Object Reconstruction from Daily Hand-Object Interaction Videos
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 159
Hoi! - A Multimodal Dataset for Force-Grounded, Cross-View Articulated Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 160
Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 161
TouchDream: 3D Object Completion through Imagined Touch
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 162
ForceVLA2: Unleashing Hybrid Force-Position Control with Force Awareness for Contact-Rich Manipulation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 163
TokenHand: Discrete Token Representation for Efficient Hand Mesh Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 164
Artiverse: A Diverse and Physically Grounded Dataset for Articulated Objects
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 165
MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 166
LogCD: Local-to-global Consistency Distillation for Few-step Image Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 167
EditCtrl: Disentangled Local and Global Control for Real-Time Generative Video Editing
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 168
Anchoring and Rescaling Attention for Semantically Coherent Inbetweening
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 169
FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 170
LightMover: Generative Light Movement with Color and Intensity Controls
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 171
Parallel Jacobi Decoding for Fast Autoregressive Image Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 172
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 173
CREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 174
EchoVDiff: Cardiac-Cycle Echocardiography Video Generation from Arbitrary Frame
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 175
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 177
Frequency-Aware Flow Matching for High-Quality Image Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 178
STARFlow-V: End-to-End Video Generative Modeling with Autoregressive Normalizing Flows
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 179
MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 180
Improving Controllable Generation: Faster Training and Better Performance via x0-Supervision
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 182
OrionEdit: Bridging Reference and Source Images for Generalized Cross-Image Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 183
PositionIC: Unified Position and Identity Consistency for Image Customization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 184
P-Flow: Prompting Visual Effects Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 185
Clair Obscur: an Illumination-Aware Method for Real-World Image Vectorization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 186
SURF: Signature-Retained Fast Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 187
The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 188
Lynx: Towards High-Fidelity Personalized Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 189
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 190
ClusterMark: Towards Robust Watermarking for Autoregressive Image Generators with Visual Token Clustering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 191
Stable Mean Flow: Lyapunov-Inspired One-Step Flow Matching
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 192
OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 193
First Frame Is the Place to Go for Video Content Customization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 194
Scaling Zero-Shot Reference-to-Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 195
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 196
VDOT: Efficient Unified Video Creation via Optimal Transport Distillation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 197
Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 198
RunawayEvil: Jailbreaking the Image-to-Video Generative Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 199
MultiAnimate: Pose-Guided Image Animation Made Extensible
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 200
Translating Signals to Languages for sEMG-Based Activity Recognition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 201
Open the Motion Door: Atomic Motion Decomposition and Recomposition for Open-Vocabulary Motion Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 203
MotionHiFlow: Text-to-Motion via Hierarchical Flow Matching
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 204
LaMoGen: Language to Motion Generation Through LLM-Guided Symbolic Inference
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 206
GVIS: Generative Vector Image Steganography
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 207
MaxMark: High-Capacity Diffusion-Native Watermarking via Robust and Invertible Latent Embedding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 208
GeoRK2: Geometry-Guided Runge–Kutta Integration for Diffusion Transformer Acceleration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 209
Test-time Sparsity for Extreme Fast Action Diffusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 210
Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 211
A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 212
When Local Rules Create Global Order: Self-Organized Representation Learning for Latent Diffusion Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 213
ViStoryBench: Comprehensive Benchmark Suite for Story Visualization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 214
R4-CGQA: Retrieval-based Vision Language Models for Computer Graphics Image Quality Assessment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 215
A³: Towards Advertising Aesthetic Assessment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 216
GraphVLM: Benchmarking Vision Language Models for Multimodal Graph Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 217
Phrase-Grounding-Aware Supervised Fine-Tuning for Chart Recognition via Side-Masked Attention
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 219
CLIP Is Shortsighted: Paying Attention Beyond the First Sentence
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 220
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 221
UZ3DVG: Unaided Zero-Shot 3D Visual Grounding with Generated Language Conditions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 222
LangField4D: Learning Identity-Adaptive and Spatio-Temporal Continuous 4D Language Fields for Dynamic Scenes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 223
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 224
CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 225
GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 227
Geometry-Guided 3D Visual Token Pruning for Video-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 228
Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 229
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 230
PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 231
Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 232
Direction-aware 3D Large Multimodal Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 233
CLAY: Conditional Visual Similarity Modulation in Vision-Language Embedding Space
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 234
Tackling Alignment Ambiguity in Person Retrieval through Conversational Attribute Mining
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 235
Beyond Global Similarity: Multi-Conditional Retrieval for Fine-Grained Cross-Modal Understanding
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 236
Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 237
What Is the Optimal Ranking Score Between Precision and Recall? We Can Always Find It and It Is Rarely F1
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 238
Robust Remote Sensing Image–Text Retrieval with Noisy Correspondence
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 239
PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 240
Single-step Diffusion-based Video Coding with Semantic-Temporal Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 241
Memory Matters: Boosting Training-Free Zero-Shot Temporal Action Localization with a Learnable Lookup Table
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 242
TVHighlights: LLM-Guided Human-Free Collaborative Training for Video Highlight Detection in Movies and TV Dramas
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 243
Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 244
Reinforcing Structured Chain-of-Thought for Video Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 245
FlexiVideo: Variation-Aware Temporal Dynamics Modeling for Efficient Video Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 246
MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 247
Learning Effective Sign Features without Text for Gloss-free Sign Language Translation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 248
META: Meta Evolution of Tool Trajectory Adaptation for Long-Video Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 249
GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 250
Local Motion Matters: A Deconstruct–Recompose Paradigm for Reinforcement Learning Pre-training from Videos
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 251
Align Once to Explain: Feature Alignment for Scalable B-cosification of Foundational Vision Transformers
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 252
Rounded or Streamlined Head? Bridging Concept Bottleneck Models and Attribute-Described Object Parts
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 253
CIGMA: Causal Information-Gain Mechanistic Attribution of Attention Heads in Vision Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 254
Rethinking Concept Bottleneck Models: From Pitfalls to Solutions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 255
Make it SING: Analyzing Semantic Invariants in Classifiers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 256
Back to the Feature: Explaining Video Classifiers with Video Counterfactual Explanations
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 257
LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 258
UniCorrn: Unified Correspondence Transformer Across 2D and 3D
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 259
Probabilistic Discrepancy Learning for Roadside LiDAR Scene Completion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 260
TACO: Task-Aware Contrastive Learning for Joint LiDAR Localization and 3D Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 261
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 262
Learning Coordinate-based Convolutional Kernels for Continuous SE(3) Equivariant and Efficient Point Cloud Analysis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 263
R3-PCQA: Ray-Reprojection-Reinforcement for No-Reference 3D Point Cloud Quality Assessment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 264
Geometric-Aware Hypergraph Reasoning for Novel Class Discovery in Point Cloud Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 265
PointCSP: Cross-Sample Semantic Propagation and Stability Preservation in Self-Supervised Point Cloud Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 266
U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 267
TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 268
Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 269
UniRefiner: Teaching Pre-trained ViTs to Self-Dispose Dross via Contrastive Register
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 270
SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 271
Heuristic-inspired Reasoning Priors Facilitate Data-Efficient Referring Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 272
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 273
AVION: Aerial Vision–Language Instruction from Offline Teacher to Prompt-Tuned Network
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 274
CrossVL: Complexity-Aware Feature Routing and Paired Curriculum for Cross-View Vision-Language Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 275
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 276
Role-SynthCLIP: A Role-Play Driven Diverse Synthetic Data Approach
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 277
BiMotion: B-spline Motion for Text-guided Dynamic 3D Character Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 278
PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 279
CADFS: A Big CAD Program Dataset and Framework for Computer-Aided Design with Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 280
MapRoute:Precise-Concept Erasing Mappers via Semantic Routing
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 281
PhotoFramer: Multi-modal Image Composition Instruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 282
Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 283
DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 285
Frequency-domain Manipulation for Face Obfuscation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 286
Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 287
Erasing Thousands of Concepts: Towards Scalable and Practical Concept Erasure for Text-to-Image Diffusion Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 288
POUR: A Provably Optimal Method for Unlearning Representation via Neural Collapse
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 289
Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 290
Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 291
SPDMark: Selective Parameter Displacement for Robust Video Watermarking
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 292
Enhancing Visual Representation with Textual Semantics: Textual Semantics-Powered Prototypes for Heterogeneous Federated Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 293
FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 294
FedSST: Rethinking Fair Federated Graph Learning under Structural Shift
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 295
GDFA: Geometry-Driven Federated Unlearning with Directional Task Vector Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 296
FedARA: Resource-adaptive Low-rank Personalized Federated Learning via Anchor-driven Representation Alignment on Heterogeneous Edge Devices
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 297
InterRVOS: Interaction-Aware Referring Video Object Segmentation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 298
RE-VLM: Event-Augmented Vision-Language Model for Scene Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 299
RegFormer: Transferable Relational Grounding for Efficient Weakly-Supervised Human-Object Interaction Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 300
Learning to Refuse: Refusal-Aware Reinforcement Fine-Tuning for Hard-Irrelevant Queries in Video Temporal Grounding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 301
GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 302
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 303
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 304
MeToM: Metadata-Guided Token Merging for Efficient Video LLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 305
Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 306
VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 307
Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 308
Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 309
CoIn: Coverage and Informativeness-Guided Token Reduction for Efficient Large Multimodal Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 310
TAMER: A Tri-Modal Contrastive Alignment and Multi-Scale Embedding Refinement Framework for Zero-Shot ECG Diagnosis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 311
Your Dissimilarities Define You: Complementary Learning Exploiting Class Diversities
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 312
CGU-Bayes: Causal Graph Uncertainty-Guided Bayesian Inference for Domain Generalization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 313
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 314
Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 315
LRDUN: A Low-Rank Deep Unfolding Network for Efficient Spectral Compressive Imaging
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 316
Neural Collapse in Test-Time Adaptation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 317
CLEX: Complementary Label Exchange Learning for Noisy Facial Expression Recognition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 318
TruckDrive: Long-Range Autonomous Highway Driving Dataset
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 319
Neuro-Cognitive Reward Modeling for Human-Centered Autonomous Vehicle Control
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 320
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 321
The Blind Spot of Adaptation: Quantifying and Mitigating Forgetting in Fine-tuned Driving Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 322
Den-TP: A Density-Balanced Data Curation and Evaluation Framework for Trajectory Prediction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 323
Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 324
GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 325
Mind the Hitch: Dynamic Calibration and Articulated Perception for Autonomous Trucks
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 327
Beyond Rule-Based Agents: Active Markov Games for Realistic Multi-Agent Interaction in Autonomous Driving
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 328
Test-Time Multi-Prompt Adaptation for Open-Vocabulary Remote Sensing Image Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 329
ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 330
CrackSSM: Reviving SSMs for Crack Segmentation via Dynamic Scanning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 331
BiPA: Bilevel Prompt Adaptation for Underwater Instance Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 332
RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 333
Scene-Centric Unsupervised Video Panoptic Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 334
Bootstrapping Video Semantic Segmentation Model via Distillation-assisted Test-Time Adaptation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 335
GeoFree-CoSeg: Unsupervised Point Cloud-Image Cross-Modal Co-Segmentation Without Geometric Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 336
Parameter-efficient Continual Learning for Enhancing Plasticity without Forgetting under Limited Model Capacity
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 337
Dual-Estimator: Decoupling Global and Local Semantic Shift for Drift Compensation in Class-Incremental Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 338
Continual Distillation of Teachers from Different Domains
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 339
Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 340
Learning from Itself: Mining Internal Knowledge from Vision Language Models for Continual Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 341
AdaPrior: Bayesian-Inspired Adaptive Prior Correction for Long-Tailed Continual Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 342
An Optimal Transport-driven Approach for Cultivating Latent Space in Online Incremental Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 343
HAD: Heterogeneity-Aware Distillation for Lifelong Heterogeneous Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 344
U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 345
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 346
FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 347
WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 348
EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 349
DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 350
TRM-VLA: Temporal-Aware Chain-of-Thought Reasoning and Memorization for Vision-Language-Action Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 351
VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 352
NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 353
HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 354
CycleBEV: Regularizing View Transformation Networks via View Cycle Consistency for Bird’s-Eye-View Semantic Segmentation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 355
STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 356
CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 357
OMG-Avatar: One-shot Multi-LOD Gaussian Head Avatar
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 358
Globally Optimal Pose from Orthographic Silhouettes
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 359
AvatarPointillist: AutoRegressive 4D Gaussian Avatarization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 360
COPO: Causal-Oriented Policy Optimization for Hallucinations of MLLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 361
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 362
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 363
HulluEdit: Single-Pass Evidence-Consistent Subspace Editing for Mitigating Hallucinations in Large Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 364
SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 365
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 366
EgoX: Egocentric Video Generation from a Single Exocentric Video
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 367
SymphoMotion: Joint Control of Camera Motion and Object Dynamics for Coherent Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 368
Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 369
SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 371
Scaling4D: Pushing the Frontier of Video Novel View Synthesis through Large-Scale Monocular Videos
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 372
PHANTOM: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 373
WorldReel: 4D Video Generation with Consistent Geometry and Motion Modeling
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 374
Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 375
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 376
D2FANet: Enhancing Video Object Detection with Dual-Domain Feature Aggregation Network
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 377
HierUQ: Hierarchical Uncertainty Quantification with Adaptive Granularity Reconciliation for Degraded Image Classification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 378
ID-Sim: An Identity-Focused Similarity Metric
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 379
Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 380
Towards Cross-Modal Preservation, Consistency and Alignment for Privacy-Preserving Visible-Infrared Person Re-Identification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 382
COPE: Consistent Occlusion and Prompt Enhancement Network for Occluded Person Re-identification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 383
Assignment-Driven Hash Learning in a Hyper-Semantic Space for On-the-Fly Category Discovery
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 384
DyFCLT: Dynamic Frequency-Decoupled Cross-Modal Learning Transformer for Multimodal Tiny Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 385
EW-DETR: Evolving World Object Detection via Incremental Low-Rank DEtection TRansformer
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 386
Building a Precise Video Language with Human–AI Oversight
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 387
CoCoVideo: The High-Quality Commercial-Model-Based Contrastive Benchmark for AI-Generated Video Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 388
Towards Sparse Video Understanding and Reasoning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 389
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 390
MuKV: Multi-Grained KV Cache Compression for Long Streaming Video Question-Answering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 391
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 392
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 393
What Are You Doing? A Closer Look at Controllable Human Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 394
Score2Instruct: Scaling Up Video Quality-Centric Instructions via Automated Dimension Scoring
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 395
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 396
Towards Holistic Modeling for Video Frame Interpolation with Auto-regressive Diffusion Transformers
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 397
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 398
Towards High-resolution and Disentangled Reference-based Sketch Colorization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 399
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 400
Layer-wise Instance Binding for Regional and Occlusion Control in Text-to-Image Diffusion Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 401
Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 402
COT-FM: Cluster-wise Optimal Transport Flow Matching
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 403
Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 404
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 405
CoopDiff: A Diffusion-Guided Approach for Cooperation under Corruptions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 406
RARE: Learn to RAnk and REtrieve for Monocular 3D Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 407
COG: Confidence-aware Optimal Geometric Correspondence for Unsupervised Single-reference Novel Object Pose Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 408
Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 409
Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 410
Long-SCOPE: Fully Sparse Long-Range Cooperative 3D Perception
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 411
Dynamics-Aware Preference Optimization for Vision-Language Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 413
Learning What Helps: Task-Aligned Context Selection for Vision Tasks
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 415
NeuroRule: Bridging Vision and Logic with Differentiable Rule Induction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 416
Beyond Graph Model: Reliable VLM Fine-Tuning via Random Graph Adapter
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 417
Ego: Embedding-Guided Personalization of Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 418
JoPPO: Hierarchical Photography Assessment via Contrastive Joint Conditional Probabilistic Reinforcement Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 419
AeroAgent: A Vision–Physics–Decision Framework for Aerodynamic Vehicle Design
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 420
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 421
Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 422
MSCD-GS: Motion-Separated Cooperative Deblurring Dynamic Reconstruction via Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 423
P2GS: Physical Prior-guided Gaussian Splatting for Photometrically Consistent Urban Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 424
iSplat: Iterative Learning for Fine-Grained Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 425
Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 426
MAPo: Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 427
FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 428
HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 429
SharpTimeGS: Sharp and Stable Dynamic Gaussian Splatting via Lifespan Modulation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 430
Physically Inspired Gaussian Splatting for HDR Novel View Synthesis
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 431
PhysIR-Splat: Physically Consistent Thermal Infrared Radiative Transfer in 3D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 432
4C4D: 4 Camera 4D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 433
SplatSuRe: Selective Super-Resolution for Multi-view Consistent 3D Gaussian Splatting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 434
GaussianZoom: Progressive Zoom-in Generative 3D Gaussian Splatting with Geometric and Semantic Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 435
MotionScale: Reconstructing Appearance, Geometry, and Motion of Dynamic Scenes with Scalable 4D Gaussian Splatting
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 436
PRIMU: Uncertainty Estimation for Novel Views in Gaussian Splatting from Primitive-Based Representations of Error and Coverage
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 437
TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 438
Disco-GS: Gaussian Splatting in Dynamic Color Lighting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 439
ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 440
GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 441
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 442
See It, Say It, Sorted: An Iterative Training-Free Framework for Visually-Grounded Multimodal Reasoning in LVLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 443
Will Multimodal Models Be Dazzled by Multi-Image Visual Puzzles?
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 444
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 445
Visual Grounding for Object Questions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 446
CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 447
What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 448
Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 449
Stable and Efficient Single-Rollout RL for Multimodal Reasoning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 450
Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 451
Monet: Reasoning in Latent Visual Space Beyond Image and Language
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 452
STAR-R1: Multi-View Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 453
From Where Things Are to What They Are For: Benchmarking Spatial–Functional Intelligence in Multimodal LLMs
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 454
Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 455
S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 456
OneSparse: A Unified Framework for Sparse Activation Layers in Vision Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 457
What Matters in Practical Learned Image Compression
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 458
BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 460
LazyVAR: Accelerating Visual Autoregressive Models via Scale-wise Token Pruning and Parallel Group Decoding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 461
Spk2VidNet: A Hierarchical Recurrent Architecture for High-Fidelity Video Reconstruction from Long Spike-Camera Streams
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 462
Adaptive Learned Image Compression with Graph Neural Networks
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 463
SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 464
VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 465
HypeVPR: Exploring Hyperbolic Space for Perspective to Equirectangular Visual Place Recognition
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 467
CoLoR: The Devil is in Scene Coordinate Regression for Large-Scale Visual Localization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 468
Affine Perspective-Three-Point Problem
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 469
Sky2Ground: A Benchmark for Site Modeling under Varying Altitude
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 470
SemanticVLA: Towards Semantic Reasoning over Action Memorization via Synergistic Explicit Trace and Latent Action Planning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 471
WebGym: Scaling Training Environments for Long-Horizon Visual Web Agents with Realistic Tasks
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 472
Beyond Perceptual Shortcuts: Causal-Inspired Debiasing Optimization for Generalizable Video Reasoning in Lightweight MLLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 473
APPO: Attention-guided Perception Policy Optimization for Video Reasoning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 474
RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 475
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 476
Visual Document Understanding and Reasoning: A Multi-Agent Collaboration Framework with Agent-Wise Adaptive Test-Time Scaling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 477
GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global–Local Feature Fusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 478
Bridging Human Evaluation to Infrared and Visible Image Fusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 479
Beyond Strict Pairing: Arbitrarily Paired Training for High-Performance Infrared and Visible Image Fusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 480
Semantic-Adaptive Diffusion for Dynamic Spatiotemporal Fusion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 481
Bayesian Decomposition and Semantic Completion for Few-shot Semantic Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 482
From Few-way to Many-way: Rethinking Few-shot Fine-grained Image Classification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 484
Selective, Regularized, and Calibrated: Harnessing Vision Foundation Models for Cross-Domain Few-Shot Semantic Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 485
FlowComposer: Composable Flows for Compositional Zero-Shot Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 486
ManifoldGD: Training-Free Hierarchical Manifold Guidance for Diffusion-Based Dataset Distillation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 487
DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 488
UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 489
Leveraging Multispectral Sensors for Color Correction in Mobile Cameras
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 490
Differentiable Adaptive 4D Structured Illumination for Joint Capture of Shape and Reflectance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 491
Optical Diffraction-based Convolution for Semiconductor Lithography
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 492
GSNR: Graph Smooth Null-Space Representation for Inverse Problems
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 493
MatE: Material Extraction from Single-Image via Geometric Prior
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 494
αMatte4K & µMatting: Dataset and Model for Ultra-Micro Precision Alpha Video Matting
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 495
Revisiting Optimal Coding for I-ToF under Practical Sensor Constraints
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 496
Dynamic Black-hole Emission Tomography with Physics-informed Neural Fields
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 497
Exploring Spatiotemporal Feature Propagation for Video-Level Compressive Spectral Reconstruction: Dataset, Model and Benchmark
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 499
SAR2Net: Learning Spatially Anchored Representations for Retrieval-Guided Cross-Stain Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 500
Advancing Cancer Prognosis with Hierarchical Fusion of Genomic, Proteomic and Pathology Imaging Data from a Systems Biology Perspective
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 501
PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 502
Any Resolution Any Geometry: From Multi-View To Multi-Patch
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 503
Paparazzo: Active Mapping of Moving 3D Objects
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 504
DepthFocus: Controllable Depth Estimation for See-Through Scenes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 505
OVI-MAP: Open-Vocabulary Instance-Semantic Mapping
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 506
PTC-Depth: Pose-Refined Monocular Depth Estimation with Temporal Consistency
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 507
SceneScribe-1M: A Large-Scale Video Dataset with Comprehensive Geometric and Semantic Annotations
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 508
Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 510
Variational Graph-based Normal Integration
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 511
Vinedresser3D: Towards Agentic Text-guided 3D Editing
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 512
MV2UV: Generating High-quality UV Texture Maps with Multiview Prompts
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 513
Learning Hierarchical Hyperbolic Mixture Model for Part-aware 3D Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 514
MeshRipple: Structured Autoregressive Generation of Artist-Meshes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 515
FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 516
Easy3E: Feed-Forward 3D Asset Editing via Rectified Voxel Flow
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 517
CUPID: Generative 3D Reconstruction via Joint Object and Pose Modeling
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 518
3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 519
DRM: Diffusion-based Reward Model With Step-wise Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 520
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 521
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 522
SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 523
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 524
Style-GRPO: Semantic-Aware Preference Optimization for Image Style Transfer Guided by Reward Modeling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 525
LAMP: Language-Assisted Motion Planning for Controllable Video Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 526
Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 527
Spectral Scalpel: Amplifying Adjacent Action Discrepancy via Frequency-Selective Filtering for Skeleton-Based Action Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 528
DETACH : Decomposed Spatio-Temporal Alignment for Exocentric Video and Ambient Sensors with Staged Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 529
Learning a Unified Latent Action Space from Videos with Action-centric Cycle Consistency
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 530
VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 531
BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 533
Spherical Leech Quantization for Visual Tokenization and Generation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 534
MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 535
GR-Gauge: Cost-efficient Training Configuration By Gauging the Gradient Redundancy
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 536
E^2-SCI: Elastic Edge–Cloud Speculative Decoding via Credit Inertia
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 537
HyperNAS: Enhancing Architecture Representation for NAS Predictor via Hypernetwork
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 538
NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 539
Spectral Conformal Risk Control: Distribution-Free Tail Guarantees via Bayesian Quadrature
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 540
Edge-RecViT: Efficient Vision Transformer via Semantic-Refined Dynamic Recursion
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 541
ERMoE: Eigen-Reparameterized Mixture-of-Experts for Stable Routing and Interpretable Specialization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 542
GUI-SAGE: Enhancing GUI Automation with Self-Explanatory Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 543
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 544
HiconAgent: History Context-aware Policy Optimization for GUI Agents
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 545
PET-DINO: Unifying Visual Cues into Grounding DINO with Prompt-Enriched Training
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 546
SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 549
Prompt-Free Universal Region Proposal Network
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 551
PaNDaS: Learnable Shape Interpolation Modeling with Localized Control
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 552
Hilbert Curve-Based Attention Enabling Topology-Preserving Image Tensor Representation for Semantic Segmentation Network
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 553
Towards High-Quality Image Segmentation: Improving Topology Accuracy by Penalizing Neighbor Pixels
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 554
SAGE: Style-Adaptive Generalization for Privacy-Constrained Semantic Segmentation Across Domains
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 555
Better than Average: Spatially-Aware Aggregation of Segmentation Uncertainty Improves Downstream Performance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 556
Universal 3D Shape Matching via Coarse-to-Fine Language Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 557
Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 558
CDICS: Delving Into Fine-Grained Attribute for In-Context Segmentation via Compositional Prompts and Phased Decoupling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 559
Discriminative Perception via Anchored Description for Reasoning Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 560
SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 561
Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 562
CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 563
Multigrain-aware Semantic Prototype Scanning and Tri-Token Prompt Learning Embraced High-Order RWKV for Pan-Sharpening
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 564
ACPV-Net: All-Class Polygonal Vectorization for Seamless Vector Map Generation from Aerial Imagery
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 565
Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 566
Rotation Invariant and Symmetry Aware Pixel Difference Network for Remote Sensing Object Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 567
F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 568
RoadGIE: Towards A Global-Scale Aerial Benchmark for Generalizable Interactive Road Extraction
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 569
PGA: Prior-free Generative Attack for Practical No-box Scenario
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 570
Lipschitz Optimization for Formal Verification of Homographies
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 571
Batman: Benign Knowledge Alignment Through Malicious Null Space in Federated Backdoor Attack
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 572
Out of Sight, Out of Track: Adversarial Attacks on Propagation-based Multi-Object Trackers via Query State Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 573
Eliminate Distance Differences Induced by Backdoor Attacks: Layer-Selective Training and Clipping to Mask Backdoor Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 574
Mitigating Error Amplification in Fast Adversarial Training
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 575
Physical Adversarial Clothing Evades Visible-Thermal Detectors via Non-Overlapping RGB-T Pattern
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 576
What Your Features Reveal: Data-Efficient Black-Box Feature Inversion Attack for Split DNNs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 577
Exposing Functional Fusion: A New Class of Strategic Backdoor in Dynamic Prompt Architectures
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 578
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 579
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 580
FM-Steer: Enhance Generalist Policies with Value-Guided Cascaded Denoising
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 581
Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 582
Visual Sim-to-Real at Scale for Humanoid Loco-Manipulation
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 583
Contact-Aware Neural Dynamics
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 584
AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 585
UAST: Unified Active Search and Tracking for Arbitrary Targets with UAVs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 586
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 587
Visual-RRT: Finding Paths toward Visual-Goals via Differentiable Rendering
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 588
Cross-Hand Latent Representation for Vision-Language-Action Models
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 589
Beyond Success: Refining Elegant Robot Manipulation from Mixed-Quality Data via Just-in-Time Intervention
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 590
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 591
GeoPredict: Leveraging Predictive Kinematics and 3D Gaussian Geometry for Precise VLA Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 592
From Manuals to Actions: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 594
Rethinking Occlusion Modeling for UAV Tracking
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 595
Adaptive Capacity Autoregressive Visual Tracking
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 596
Spatio-Temporal Conditional Denoising Transformer for Modality-Missing RGBT Tracking
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 597
Breaking Smooth-Motion Assumptions: A UAV Benchmark for Multi-Object Tracking in Complex and Adverse Conditions
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 598
TrackMAE: Video Representation Learning via Track Mask and Predict
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 599
Dual-branch Distilled Transformer for Efficient Asymmetric UAV Tracking
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 600
Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 601
Scaling Self-Supervised and Cross-Modal Pretraining for Volumetric CT Transformers
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 602
MuViT: Multi-Resolution Vision Transformers for Learning Across Scales in Microscopy
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 603
SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 604
Multimodal Causality-Driven Representation Learning for Generalizable Medical Image Segmentation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 605
Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization
[
Slides]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 606
TopoSlide: Topologically-Informed Histopathology Whole Slide Image Representation Learning
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 607
Beyond the Static-World: Lifelong Learning for All-in-One Medical Image Restoration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 608
Hyperbolic Relational Prompts for Intersectional Fairness in Medical VLMs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 609
RNED: Rotary Number Encoding and Decoding for Quantitative Medical VLM Analysis
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 610
MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 611
Learning Generalizable 3D Medical Image Representations from Mask-Guided Self-Supervision
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 612
BiOTPrompt: Bidirectional Optimal Transport Guided Prompting for Disease Evolution-aware Radiology Report Generation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 613
Learning to See Through a Baby’s Eyes: Early Visual Diets Enable Robust Visual Intelligence in Humans and Machines
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 614
UDAPose: Unsupervised Domain Adaptation for Low-Light Human Pose Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 615
Enhancing Accuracy of Uncertainty Estimation in Appearance-based Gaze Tracking with Probabilistic Evaluation and Calibration
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 616
SCAPO: Self-Supervised Category-Level Articulated Pose Estimation from a Single 3D Observation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 617
Composite-Attribute Person Re-Identification via Pose-Guided Disentanglement
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 618
Representing 3D Faces with Learnable B-Spline Volumes
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 619
RHINO: Reconstructing Human Interactions with Novel Objects from Monocular Videos
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 620
HumanBA: Human-Aware Bundle Adjustment via Global Human-Camera Decoupling
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 621
HamiPose: Hamiltonian Optimization for Unsupervised Domain Adaptive Pose Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 622
KASALv2: Fully Automatic 3D Rotational Symmetry Classification and Axis Localization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 623
AnyLift: Scaling Motion Reconstruction from Internet Videos via 2D Diffusion
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 624
Active Inference for Micro-Gesture Recognition: EFE-Guided Temporal Sampling and Adaptive Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 625
ArtPro: Self-Supervised Articulated Object Reconstruction with Adaptive Integration of Mobility Proposals
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 626
Similarity-Consistent Likelihood Diffusion enables Hidden Person Detection from Wall Reflections
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 627
VLM-Guided Group Preference Alignment for Diffusion-based Human Mesh Recovery
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 628
Occluded Human Body Capture with Frequency Domain Denoising Prior
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 629
ResiHMR: Residual-Limb Aware Single-Image 3D Human Mesh Recovery for Individuals with Limb Loss
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 630
OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 631
MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 632
Exploring Adaptive Masked Reconstruction for Self-Supervised Skeleton-Based Action Recognition
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 633
DFD-HR: Generalizable Deepfake Detection via Hierarchical Routing Learning
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 634
MGDHand: Multi-Granularity Prior-to-Inertial Distillation Framework for Sequential 3D Hand Pose Estimation from Sparse IMUs
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 635
CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 636
E-3DPSM: A State Machine for Event-based Egocentric 3D Human Pose Estimation
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 637
Bézier Degradation Modeling for LiDAR-based Human Motion Capture
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 638
UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 639
Illumination-Consistent Human-Scene Reconstruction from Monocular Video
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 640
Attribution as Retrieval: Model-Agnostic AI-Generated Image Attribution
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 641
Agent4FaceForgery: Multi-Agent LLM Framework for Realistic Face Forgery Detection
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 642
Enabling Supervised Learning of Generative Signatures for Generalized Synthetic Image Detection
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 643
DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 644
All in One: Unifying Deepfake Detection, Tampering Localization, and Source Tracing with a Robust Landmark-Identity Watermark
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 645
Towards an Incremental Unified Multimodal Anomaly Detection: Augmenting Multimodal Denoising From an Information Bottleneck Perspective
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 646
AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 647
Dual-Prototype-Guided Multi-task Learning for Unsupervised Anomaly Detection and Classification
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 648
The Road Less Seen: Segment Exploration for Weakly Supervised Video Anomaly Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 649
Omni-AD: A Large-scale and Versatile Benchmark for Industrial Anomaly Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 650
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 651
Complementary Prototype Mapping for Efficient Multimodal Anomaly Detection
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 652
LiDAS: Lighting-driven Dynamic Active Sensing for Nighttime Perception
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 653
Gau-Occ: Geometry-Completed Gaussians for Multi-Modal 3D Occupancy Prediction
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 654
OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 655
An Instance-Centric Panoptic Occupancy Prediction Benchmark for Autonomous Driving
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 656
OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera
[
Poster]
Poster
Fri Jun 05 03:00 PM -- 05:00 PM (PDT) @ ExHall A & F 657
ProOOD: Prototype-Guided Out-of-Distribution 3D Occupancy Prediction
[
Poster]
Successful Page Load