Toggle Poster Visibility
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 1
Breaking Semantic Boundaries: Distribution-Guided Semantic Exploration for Creative Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 2
Guiding a Diffusion Model by Swapping Its Tokens
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 3
PixelDiT: Pixel Diffusion Transformers for Image Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 5
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 6
Streaming Diffusion Model for Fast Infrared and Visible Video Fusion
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 7
ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 8
CoSMo3D: Open-World Promptable 3D Semantic Segmentation through LLM-Guided Canonical Spatial Modeling
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 9
GeoViS: Geospatially Rewarded Visual Search for Remote Sensing Visual Grounding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 10
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 11
S^2AM3D: Scale-controllable Part Segmentation of 3D Point Clouds
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 12
Scalable Multi-View Subspace Clustering with Tensorized Anchor Guidance
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 13
3D-LATTE: Latent Space 3D Editing from Textual Instructions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 14
AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 15
ChordEdit: One-Step Low-Energy Transport for Image Editing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 16
Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 17
Native and Compact Structured Latents for 3D Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 18
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 19
Differentiable Vector Quantization for Rate-Distortion Optimization of Generative Image Compression
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 20
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 21
MDCS-MoAME: Multi-directional Composite Scanning with Mixture of Attention and Mamba Experts for Cancer Survival Prediction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 22
PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 23
PAVAS: Physics-Aware Video-to-Audio Synthesis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 24
ProPhy: Progressive Physical Alignment for Dynamic World Simulation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 25
V-DPM: 4D Video Reconstruction with Dynamic Point Maps
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 26
Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 27
Mesh4D: 4D Mesh Reconstruction and Tracking from Monocular Video
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 28
SPE-MVS: Spatial Position Encoding Enhanced Multi-View Stereo with Monocular Depth Priors
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 29
Block-Sparse Global Attention for Efficient Multi-View Geometry Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 30
SMVRT: Implicit Human 3D Modeling Using Sparse Multi-View Volumetric Reconstruction with Transformer Fusion
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 31
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 32
Any4D: Unified Feed-Forward Metric 4D Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 33
Co-Me: Confidence Guided Token Merging for Visual Geometric Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 34
Point4Cast: Streaming Dynamic Scene Reconstruction and Forecasting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 35
AMB3R: Accurate Feed-forward Metric-scale 3D Reconstruction with Backend
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 36
AlignPose: Generalizable 6D Pose Estimation via Multi-view Feature-metric Alignment
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 37
Parallelised Differentiable Straightest Geodesics for 3D Meshes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 38
Geometry-Aligned and Anomaly-Aware Reconstruction for 3D Anomaly Detection
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 39
DVGT: Driving Visual Geometry Transformer
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 41
MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 42
Foundation Encoders Are All You Need for Preference-Aware Personalization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 43
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 44
ThinkGen: Generalized Thinking for Visual Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 45
CoLoGen: Progressive Learning of Concept–Localization Duality for Unified Image Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 46
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 47
When Safety Collides: Resolving Multi-Category Harmful Conflicts in Text-to-Image Diffusion via Adaptive Safety Guidance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 48
PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 49
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 50
Multimodal Semantic Bias Mitigation for Diverse Text-To-3D Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 51
Visual Personalization Turing Test
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 52
Composing Concepts from Images and Videos via Concept-prompt Binding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 53
Less is More: Data-Efficient Adaptation for Controllable Text-to-Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 54
Semantic Derivative Flow: Graph-Guided Diffusion for Controllable Instance Interactions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 55
Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 56
Hierarchical Enhancement of Semantic Priors for Disentangled Text-Driven Motion Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 57
Simpleposter: A Simple Baseline For Product Poster Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 58
Prompt Yourself: Awakening Textual Semantics in 1D Visual Tokenizers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 59
SkyReels-Text: Fine-Grained Font-Controllable Text Editing for Poster Design
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 60
Image Generation from Contextually-Contradictory Prompts
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 61
PromptEnhancer: Taming Your Rewriter for Text-to-Image Generation via Fine-Grained Reward
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 62
Aligning Text, Images and 3D Structure Token-by-Token
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 63
RefTon: Reference person shot assist virtual Try-on
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 64
GaussianVision: Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 65
Copy-Transform-Paste: Zero-Shot Object-Object Alignment Guided by Vision-Language and Geometric Constraints
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 66
Gravitation-Driven Semantic Alignment for Text Video Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 67
MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 68
M^3KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 69
Evolutionary Multimodal Reasoning via Hierarchical Semantic Representation for Intent Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 70
ReFAct: Empowering Multimodal Web Agents with Visual and Context Focusing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 72
MR-RAG: Multimodal Relevance-Aware Retrieval-Augmented Generation for Medical Visual Question Answering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 73
Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 74
CUE: Concept-Aware Multi-Label Expansion to Mitigate Concept Confusion in Long-Tailed Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 75
Energy Waveify and Redistribution for Test-Time Adaptation: A Control System Perspective
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 76
CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 77
CoFiDA-M: Concept-Aware Feature Modulation for Cross-Domain Adaptation with Image-Only Inference
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 78
Towards Multimodal Domain Generalization with Few Labels
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 79
Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 80
Event6D: Event-based Novel Object 6D Pose Tracking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 81
EV-CGNet: Co-visible Focused 3D-guided 2D Event Keypoint Detection Network
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 82
AE2VID: Event-based Video Reconstruction via Aperture Modulation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 83
From Contrast to Consistency: Rethinking Event-based Continuous-Time Optical Flow Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 84
Spike-driven Discrete Aggregation for Event-based Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 85
x^2-Fusion: Cross-Modality and Cross-Dimension Flow Estimation in Event Edge Space
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 86
FloVerse: Floor Plan-Guided Multi-Modal Navigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 87
TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 88
History to Future: Evolving Agent with Experience and Thought for Zero-shot Vision-and-Language Navigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 89
DreamSAC: Learning Hamiltonian World Models via Symmetry Exploration
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 90
Beyond Scanpaths: Graph-Based Gaze Simulation in Dynamic Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 91
CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 92
Rethinking Visual Rearrangement from A Diffusion Perspective
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 93
APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 94
Bridging the 2D-3D Gap: A Hierarchical Semantic-Geometric Map for Vision Language Navigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 95
InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 96
When Robots Should Say ''I Don’t Know'': Benchmarking Abstention in Embodied Question Answering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 97
RoboAgent: Chaining Basic Capabilities for Embodied Task Planning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 98
Towards Training-free Scene Text Editing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 99
VINS-120K: Ultra High-Resolution Image Editing with A Large-Scale Dataset
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 100
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 101
Charge: A Comprehensive Novel View Synthesis Benchmark and Dataset to Bind Them All
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 102
Region-Wise Correspondence Prediction between Manga Line Art Images
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 103
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 104
I2I-Bench: A Comprehensive Benchmark Suite for Image-to-Image Editing Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 105
TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 106
Hermite Radial Basis Function for Surface Reconstruction via Differentiable Rendering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 107
RF4D:Neural Radar Fields for Novel View Synthesis in Outdoor Dynamic Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 108
Voxify3D: Pixel Art Meets Volumetric Rendering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 109
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 110
FluidGaussian: Propagating Simulation-Based Uncertainty Toward Functionally-Intelligent 3D Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 111
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 112
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 113
Turbo-GS: Accelerating 3D Gaussian Fitting for High-Resolution Radiance Fields
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 114
BiProLoRA: Bilevel Prompt LoRA for Real Scene Recovery
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 115
Degradation-Consistent Test-Time Adaptation for All-in-One Image Restoration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 116
CanonCGT: Reference-Based Color Grading via Canonical Pivot Representation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 117
2-Shots in the Dark: Low-Light Denoising with Minimal Data Acquisition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 118
Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 119
It Takes Two: A Duet of Periodicity and Directionality for Burst Flicker Removal
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 120
Scan Clusters, Not Pixels: A Cluster-Centric Paradigm for Efficient Ultra-high-definition Image Restoration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 121
Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 122
Dynamic Exposure Burst Image Restoration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 123
FAPE-IR: Frequency-Aware Planning and Execution Framework for All-in-One Image Restoration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 124
ColorFLUX: A Structure-Color Decoupling Framework for Old Photo Colorization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 125
VEMamba: Efficient Isotropic Reconstruction of Volume Electron Microscopy with Axial-Lateral Consistent Mamba
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 126
Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 127
EMGauss: Continuous Slice-to-3D Reconstruction via Dynamic Gaussian Modeling in Volume Electron Microscopy
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 128
Underground Plant Exploration: Non-Destructive 3D Root Assessment with GPR Based on Point Graph Neural Network
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 129
Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 130
MicroFM: Physics-guided Flow Matching for Isotropic Microscopy Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 131
Dynamic Stream Network for Combinatorial Explosion Problem in Deformable Medical Image Registration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 133
Towards Robust Vision Transformers: Path Dependency Analysis and a Simple Two-Stage Adversarial Training
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 135
When CLIP Sees More, It Fights Back Harder: Multi-View Guided Adaptive Counterattacks for Test-Time Adversarial Robustness
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 136
Hidden Dangers of Compositional Generation: Diagnosing Semantic Safety Failures in Text-to-Image Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 137
VisiLock: Authorizing Instruction-based Image editing with Dual Score Distillation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 138
JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 139
GenBreak: Red Teaming Text-to-Image Generation Using Large Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 140
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 141
Generate, Analyze, and Refine: Training-Free Sound Source Localization via MLLM Meta-Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 142
MMCP-GEN: A Modality-Extensible Diffusion Language Model for Conditional Protein Sequence Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 143
Few-shot Acoustic Synthesis with Multimodal Flow Matching
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 144
CLIP-like Model as a Foundational Density Ratio Estimator
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 145
Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 146
EgoAVU: Egocentric Audio-Visual Understanding
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 147
Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 148
Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 149
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 150
Adaptive Confidence Regularization for Multimodal Failure Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 151
Factorize, Reconstruct, Enhance: A Unified Framework for Multimodal Sentiment Analysis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 152
PhenoYieldNet: Learning Crop-Aware Phenological Responses for Multi-Crop Yield Prediction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 153
Conflict-Aware Adaptive Cross-Reconstruction for Multimodal Sentiment Analysis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 154
EduDiag: A Benchmark for Educational Diagnostic Reasoning with Error Tracing and Correction on Large Multimodal Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 156
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 157
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 158
Cross-Modal Guided Visual Synthesis for Data-Efficient Multimodal Depression Recognition
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 159
AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 160
PAM: A Pose–Appearance–Motion Engine for Sim-to-Real HOI Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 161
AffordGen: Generating Diverse Demonstrations for Generalizable Object Manipulation with Affordance Correspondence
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 162
HandWorld: Hand-Centric Unified Video Action Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 163
HVG-3D: Bridging Real and Simulation Domains for 3D-Conditional Hand-Object Interaction Video Synthesis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 164
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 165
LAM: Language Articulated Object Modelers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 166
Haptic Neural Fields: Bringing Tactile Interactions to 3D Rendered Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 167
Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 168
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 169
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 170
Temporal Equilibrium MeanFlow: Bridging the Scale Gap for One-Step Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 171
PROMO: Promptable Outfitting for Efficient High-Fidelity Virtual Try-On
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 172
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 173
UniSER: A Foundation Model for Unified Soft Effects Removal
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 174
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 175
Inference-time Physics Alignment of Video Generative Models with Latent World Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 176
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 177
Plenoptic Video Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 178
PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 179
AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 180
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 181
Flowception: Temporally Expansive Flow Matching for Video Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 182
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 183
Linear Image Generation by Synthesizing Exposure Brackets
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 184
Low-Resolution Editing is All You Need for High-Resolution Editing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 185
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 186
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 187
VENI: Variational Encoder for Natural Illumination
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 188
SketchAssist: A Practical Assistant for Semantic Edits and Precise Local Redrawing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 189
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 190
MoCha: End-to-End Video Character Replacement without Structural Guidance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 191
Negative Binomial Variational Autoencoders for Overdispersed Latent Modeling
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 193
VOSR: A Vision-Only Generative Model for Image Super-Resolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 194
Dual Graph Regularized Deep Unfolding Network for Guided Depth Map Super-resolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 195
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 197
Gradient Knows Best: Mixed-Precision Quantization via Gradient-Guided Bit Allocation for Super-Resolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 198
Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark Dataset
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 199
Next-Scale Autoregressive Models for Text-to-Motion Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 200
Push-and-Step: From RL-Based Balance Recovery to Physical Simulation of Dense Crowds
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 201
Iterative Closed-Loop Motion Synthesis for Scaling the Capabilities of Humanoid Control
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 202
RoMo: A Large-Scale, Richly Organized Dataset and Semantic Taxonomy for Human Motion Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 203
FrankenMotion: Part-level Human Motion Generation and Composition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 204
HSI-GPT2: A Dual-Granularity Large Motion Reasoning Model with Diffusion Refinement for Human–Scene Interaction
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 205
SceMoS: Scene-Aware 3D Human Motion Synthesis by Planning with Geometry-Grounded Tokens
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 206
Progressive Guessing to Fixed Point: Rethinking Human Motion Prediction with Deep Equilibrium Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 207
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 208
ReMoGen: Real-time Human Interaction-to-Reaction Generation via Modular Learning from Diverse Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 209
Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 210
PatchScene: Patch-based Voxel Diffusion Model for Large-Scale Scene Completion
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 211
Prototype-Guided Concept Erasure in Diffusion Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 212
Any2Any 3D Diffusion Models with Knowledge Transfer: A Radiotherapy Planning Study
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 213
CARD: Correlation Aware Restoration with Diffusion
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 214
DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 215
DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 216
Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 219
MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition–Perception–Reasoning Guided Text-Image Machine Translation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 220
M3DocDep: Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models
[
Slides]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 221
Towards Policy-Adaptive Image Guardrail: Benchmark and Method
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 222
Flat-Pack Bench: Evaluating Spatio-Temporal Understanding in Large Vision-Language Models through Furniture Assembly
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 223
TextFM: Robust Semi-dense Feature Matching with Language Guidance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 224
What’s Wrong with Synthetic Data for Scene Text Recognition? A Strong Synthetic Engine with Diverse Simulations and Self-Evolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 225
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 226
SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 227
Point Cloud as a Foreign Language for Multi-modal Large Language Model
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 228
Grounded 3D-Aware Spatial Vision-Language Modeling
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 229
SpatialTree: How Spatial Intelligence Branches Out in MLLMs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 230
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 231
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 232
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 233
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 235
Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 236
REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 237
From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 238
MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 239
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 240
ReMatch: Boosting Representation through Matching for Multimodal Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 241
RI-Mamba: Rotation-Invariant Mamba for Robust Text-to-Shape Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 242
Revisiting F-measure Optimization in Multi-Label Classification: A Sampling-based Approach
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 243
Thinking Beyond Labels: Vocabulary-Free Fine-Grained Recognition using Reasoning-Augmented LMMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 244
WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 245
Modeling the Visual Ambiguity of Human Sketches
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 246
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 247
ConeSep: Cone-based Robust Noise-Unlearning Compositional Network for Composed Image Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 248
V^2-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 249
WeaveTime: Streaming from Earlier Frames into Emergent Memory in VideoLLMs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 250
Streaming Video Crime Anticipation with Spatio-Temporal Causal Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 251
Efficient Frame Selection for Long Video Understanding via Reinforcement Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 252
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 253
InternVideo-Next: Towards World-Understanding Video Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 254
Condensed Test-Time Adaptation of VLMs for Action Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 255
Test-time Ego-Exo-centric Adaptation for Action Anticipation via Multi-Label Prototype Growing and Dual-Clue Consistency
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 256
A Stitch in Time: Learning Procedural Workflow via Self-Supervised Plackett–Luce Ranking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 257
SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 258
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 259
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 260
Explaining Object Detectors via Collective Contribution of Pixels
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 261
Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 262
H-Sets: Hessian-Guided Discovery of Set-Level Feature Interactions in Image Classifiers
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 263
Evaluating Generative Models via One-Dimensional Code Distributions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 264
TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 265
BuildAnyPoint: 3D Building Structured Abstraction from Diverse Point Clouds
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 266
LiDAR-to-4DRadar Diffusion Bridge via Cross-Modal Alignment and Translation in Latent Space
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 267
Edges Compete for Trust: Group Relative Edge Optimization for Building Reconstruction from Point Clouds
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 268
Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 269
QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 270
L3DR: 3D-aware LiDAR Diffusion and Rectification
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 271
Ghost-FWL: A Large-Scale Full-Waveform LiDAR Dataset for Ghost Detection and Removal
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 272
Ghosts in the Point Clouds: De-glaring LiDAR in the Transient Domain
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 273
MS^2Gait: A Multi-Scale Spatio-Temporal Fusion Network for LiDAR-based Gait Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 275
Learning to Identify Out-of-Distribution Objects for 3D LiDAR Anomaly Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 276
Dual-Level Confidence based Implicit Self-Refinement for Medical Visual Question Answering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 277
FedMPT: Federated Multi-Label Prompt Tuning of Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 278
Rethinking Model Selection in VLM Through the Lens of Gromov-Wasserstein Distance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 279
NTK-Guided Implicit Neural Teaching
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 281
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 282
Harmonious Parameter Adaptation in Continual Visual Instruction Tuning for Safety-Aligned MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 283
StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 284
Same or Not? Enhancing Visual Perception in Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 285
Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 286
AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 287
Animator-Centric Skeleton Generation on Objects with Fine-Grained Details
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 288
Synthesizing Visual Concepts as Vision-Language Programs
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 289
Self-Consistency for LLM-Based Motion Trajectory Generation and Verification
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 290
Semantic Scale Space: A Framework for Controllable Image Abstraction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 291
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 292
DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 293
SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 294
Designing to Forget: Deep Semi-parametric Models for Unlearning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 295
Meta-FC: Meta-Learning with Feature Consistency for Robust and Generalizable Watermarking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 296
PrivSynth: Alternating and Control-Based Optimization for Privacy and Utility in Synthetic Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 297
Neighbor-Aware Localized Concept Erasure in Text-to-Image Diffusion Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 298
EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 299
Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 300
A Polynomial Chaos Framework for Causal Discovery in Nonlinear Uncertain Systems
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 302
From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 303
Fine-Tuning Impairs the Balancedness of Foundation Models in Long-tailed Personalized Federated Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 304
Few-for-Many Personalized Federated Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 305
ProxyFL: A Proxy-Guided Framework for Federated Semi-Supervised Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 306
Domain Sensitive Federated Learning with Fisher-Informed Pruning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 307
SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 308
Bridging Facial Understanding and Animation via Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 309
AR²-4FV: Anchored Referring and Re-identification for Long-Term Grounding in Fixed-View Videos
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 310
CVA: Context-aware Video-text Alignment for Video Temporal Grounding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 311
OmniGround: A Comprehensive Spatio-Temporal Grounding Benchmark for Real-World Complex Scenarios
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 312
ST4R-Splat: Spatio-Temporal Referring Segmentation in 4D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 314
Rejection Mixing: Fast Semantic Propagation of Mask Tokens for Efficient DLLM Inference
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 315
Towards Unified Human Perception and Machine Understanding: Token Flow Guided Compression Framework
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 316
A More Word-like Image Tokenization for MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 317
DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 318
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 319
One Layer’s Trash is Another Layer’s Treasure: Adaptive Layer-wise Visual Token Selection in LVLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 320
OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 321
Tunable Soft Equivariance with Guarantees
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 322
Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 323
Cluster-aware Anchor Learning for Multi-View Clustering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 324
Revisiting Sparsity Constraint Under High-Rank Property in Partial Multi-Label Learning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 325
Weight Space Representation Learning via Neural Field Adaptation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 326
Recurrent Video Masked Autoencoders
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 327
Revisiting Unknowns: Towards Effective and Efficient Open-Set Active Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 328
Seeing Through the Shift: Causality-Inspired Robust Generalized Category Discovery
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 329
From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 330
Spatial Retrieval Augmented Autonomous Driving
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 331
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 332
ColaVLA: Leveraging Cognitive Latent Reasoning for Hierarchical Parallel Trajectory Planning in Autonomous Driving
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 333
CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 334
MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 335
WPT: World-to-Policy Transfer via Online World Model Distillation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 336
ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 337
Recover to Predict: Progressive Retrospective Learning for Variable-Length Trajectory Prediction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 338
URScenes: A Multi-scenario Dataset for Unstructured Road Environments
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 339
MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Driving
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 340
SAMosaic3D: Modular Scene Assembly for Real-Time 3D Segment Anything
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 342
MV3DIS: Multi-View Mask Matching via 3D Guides for Zero-Shot 3D Instance Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 343
PEARL: Geometry Aligns Semantics for Training-Free Open-Vocabulary Semantic Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 344
RAVEN: Radar Adaptive Vision Encoders for Efficient Chirp-wise Object Detection and Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 345
SAMIX: Reinforcing SAM2 with Semantic Adapter and Reference Selecting Policy for Mix-Supervised Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 346
MARSS: Radar Semantic Segmentation via Modular Attention and State Space Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 347
MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 348
Exemplar-Free Class Incremental Learning via Preserving Class-Discriminative Structure
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 349
Critical Patch-Aware Sparse Prompting with Decoupled Training for Continual Learning on the Edge
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 350
PACT: Phase-Like Transition Constraints in Adapter-Based Continual Learning of Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 351
Representation-Steered Incremental Adapter-Tuning for Class-Incremental Learning with Pre-Trained Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 352
Re-evaluating Continual VQA: Toward Fair and Robust Evaluation for Multimodal Continual Learning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 354
Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 355
Beyond Myopic Alignment: Lookahead Optimization for Online Class-Incremental Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 356
EmoDiffTalk: Emotion-aware Diffusion for Editable 3D Gaussian Talking Head
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 357
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 358
D^3FER: Dual Channel and Dual Branch Network for Robust Facial Expression Recognition under Dual Challenges
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 359
HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 360
ExpPortrait: Expressive Portrait Generation via Personalized Representation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 361
PersonaLive! Expressive Portrait Image Animation for Live Streaming
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 362
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 363
OptiMVMap: Offline Vectorized Map Construction via Optimal Multi-vehicle Perspectives
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 364
CogDriver: Integrating Cognitive Inertia for Temporally Coherent Planning in Autonomous Driving
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 365
TopoHR: Hierarchical Centerline Representation for Cyclic Topology Reasoning in Driving Scenes with Point-to-Instance Relations
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 366
AURA: Multi-modal Shared Autonomy for Urban Navigation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 367
Zero-Shot Reconstruction of Animatable 3D Avatars with Cloth Dynamics from a Single Image
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 368
FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 369
Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 370
UIKA: Fast Universal Head Avatar from Pose-Free Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 371
FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 372
First Logit Boosting: Visual Grounding Method to Mitigate Object Hallucination in Large Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 373
Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 374
Envision, Attend, Then Respond: Counterfactual Hallucination Mitigation in Large Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 375
PAS: Prelim Attention Score for Detecting Object Hallucinations in Large Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 376
MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 377
Fine-Grained Multi Image Object Hallucination Benchmark
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 378
Generative Video Motion Editing with 3D Point Tracks
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 379
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 380
Learning to Generate Highly Dynamic Videos using Synthetic Motion Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 381
Stereo World Model: Camera-Guided Stereo Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 382
CG-Floor: Centroid-Guided Diffusion for Large-Scale Floorplan Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 383
MAD: Motion Appearance Decoupling for efficient Driving World Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 384
VDFE: Difference-Aware 3D Scene Editing with Non-Intrusive Video Diffusion Priors for Multi-View Consistency and Efficiency
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 385
Endless World: Real-Time 3D-Aware Long Video Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 386
SpatialDiff: 3D-Aware Object Movement via Implicit Spatial Modeling
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 387
Towards Realistic and Consistent Orbital Video Generation via 3D Foundation Priors
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 388
YOLO-ULM: Ultra-Lightweight Models for Real-Time Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 389
CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 390
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 391
VLM4RSDet: Collaborative Optimization with Vision-Language Model for Enhancing Remote Sensing Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 392
WiTTA-Bench: Benchmarking Test-Time Adaptation for WiFi Sensing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 393
MFEN: Multi-Frequency Expert Network for Visible-Infrared Person Re-ID
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 394
Object-Generalized Re-Identification: A Step Towards Universal Instance Perception
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 395
When Transformers Meet Mamba: A Hybrid Transformer-Mamba Network for Video Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 396
Prompt-Anchored Vision–Text Distillation for Lifelong Person Re-identification
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 397
HyperGait: Unleashing the Power of Parsing for Gait Recognition in the Wild via Hypergraph
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 398
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 399
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 400
Beyond Caption-Based Queries in Video Moment Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 401
Neural-Centric Video Processing Pipeline for Unified Multi-Task Inference
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 402
VideoRealBench: A Chain-of-Thought Realism Evaluation Benchmark for Generated Human-Centric Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 403
VAST: Video Ability‑Stratified Taxonomy for Data‑Efficient Video Reasoning
[
Slides]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 404
An Empirical Study on How Video-LLMs Answer Video Questions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 405
FPSBench: A Benchmark for Video Understanding at High Frame Rates
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 406
UniComp: Rethinking Video Compression Through Informational Uniqueness
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 407
NaTex: Seamless Texture Generation as Latent Color Diffusion
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 408
Your Latent Mask is Wrong: Pixel-Equivalent Latent Compositing for Diffusion Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 409
Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 410
Attribute-Preserving Pseudo-Labeling for Diffusion-Based Face Swapping
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 411
Delta Rectified Flow Sampling for Text-to-Image Editing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 412
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 413
SpotEdit: Selective Region Editing in Diffusion Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 414
All-in-One Slider for Attribute Manipulation in Diffusion Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 415
DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 416
From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 417
CATNet: Collaborative Alignment and Transformation Network for Cooperative Perception
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 418
Scene Reconstruction as Mapping Priors for 3D Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 419
CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 420
Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 421
R4Det: 4D Radar-Camera Fusion for High-Performance 3D Object Detection
[
Slides]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 422
Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 423
Few-Shot Incremental 3D Object Detection in Dynamic Indoor Environments
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 424
Learning from Synthetic Data via Provenance-Based Input Gradient Guidance
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 425
Seeing Clearly, Reasoning Confidently: Plug-and-Play Remedies for Vision Language Model Blindness
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 427
R2G: A Multi-View Circuit Graph Benchmark Suite from RTL to GDSII
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 428
VQ-VA World: Towards High-Quality Visual Question-Visual Answering
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 430
Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 431
See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 432
PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 433
μVLM: A Vision Language Model for μNPUs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 434
Gaussian Mapping for Evolving Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 435
Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 436
AnchorSplat: Feed-Forward 3D Gaussian Splatting With 3D Geometric Priors
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 437
SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 438
Faster-GS: Analyzing and Improving Gaussian Splatting Optimization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 439
Layered 4D-Rotor Gaussian Splatting: A Compressed Representation for Long Dynamic Scenes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 440
GaussianGrow: Geometry-aware Gaussian Growing from 3D Point Clouds with Text Guidance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 441
PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 442
3D Gaussian Splatting at Arbitrary Resolutions with Compact Proxy Anchors
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 443
Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 444
AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 445
GaussianPile: A Unified Sparse Gaussian Splatting Framework for Slice-based Volumetric Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 446
More Natural, More Real: Object-aware Gaussian Splatting for 3D Visual Decoding from Human Brain
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 447
Eulerian Gaussian Splatting using Hashed Probability Pyramids
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 448
Confidence-Guided Multi-Scale Aggregation for Sparse-View High-Resolution 3D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 449
ULF-Loc: Unbiased Landmark Feature for Robust Visual Localization with 3D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 450
Robust3DGSW: Toward Robust Watermarking for Quantization-Aware 3D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 451
ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 452
L^2DGS: Low-Light Dynamic Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 453
Probabilistic Concept Graph Reasoning for Multimodal Misinformation Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 454
POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 455
SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 456
CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 457
DeepScan: A Training-Free Framework for Visually Grounded Reasoning in Large Vision-Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 459
HUMORCHAIN: Theory-Guided Multi-Stage Reasoning for Interpretable Multimodal Humor Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 460
CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 461
Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 462
Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 463
Mario: Multimodal Graph Reasoning with Large Language Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 464
Boosting Reasoning in Large Multimodal Models via Activation Replay
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 465
Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 466
Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 467
ROSE: Rotate Your Large Language Model to See
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 468
OpenMMReasoner: Pushing the Frontiers in Multimodal Reasoning with an Open and General Recipe
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 470
Sparsity as a Key: Unlocking New Insights from Latent Structures for Out-of-Distribution Detection
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 471
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 472
Suppressing Non-Semantic Noise in Masked Image Modeling Representations
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 474
DeDelayed: Deleting Remote Inference Delay via On-Device Correction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 476
Gaussian Splatting-based Low-Rank Tensor Representation for Multi-Dimensional Image Recovery
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 477
Precise Object and Effect Removal with Adaptive Target-Aware Attention
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 478
Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 479
FreqSIC: Frequency-aware Stereo Image Compression with Bi-directional Checkerboard Context Model
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 480
SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 481
Fusion of Depth and Semantics for Probabilistic Floorplan Localization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 482
A2GC: Asymmetric Aggregation with Geometric Constraints for Locally Aggregated Descriptors
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 483
Geo2: Geometry-Guided Cross-view Geo-Localization and Image Synthesis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 485
Resolving Evidence Sparsity: Agentic Context Engineering for Long-Document Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 486
Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 487
ORCA: Orchestrated Reasoning with Collaborative Agents for Document Visual Question Answering
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 488
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 489
A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 490
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 491
LensWalk: Agentic Video Understanding by Planning How You See in Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 492
DPGF-Net: Dual-Prior Guided Fusion Network for Joint Assessment of Perceptual Quality and Semantic Consistency in AI-Generated Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 493
RegionFuse: Region-Adaptive Pixel Distribution Learning for Infrared and Visible Image Fusion
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 494
Missing No More: Dictionary-Guided Cross-Modal Image Fusion under Missing Infrared
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 495
VideoFusion: A Spatio-Temporal Collaborative Network for Multi-modal Video Fusion
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 496
TAPE: Task-Adaptive Prototype Evolution in Audio-Language Models for Fully Few-shot Class-incremental Audio Classification
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 497
Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 499
Hyperbolic Defect Feature Synthesis for Few-Shot Defect Classification
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 500
Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning Adapters
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 501
Learning to Learn Weight Generation via Local Consistency Diffusion
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 502
Balanced Dataset Distillation via Modeling Multiple Visual Pattern Distribution
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 503
Grid Distillation: Compositional Image Distillation via Structured Generative Grids
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 504
Dataset Distillation by Influence Matching
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 505
StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 506
Seeing Through Blur: Tackling Defocus in Spike-Based Imaging
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 507
Distilling Quasi-Conformal Mapping: A Generalizable and Efficient Solution for Wide-Angle Correction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 508
Lighting in Motion: Spatiotemporal HDR Lighting Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 509
LightRR: A Lightweight Network for Single Image Reflection Removal
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 510
HFR and HDR Video from Multi-Attenuated Spikes Using a Rapidly Rotating SpokeND Filter
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 511
Coded-E2LF: Coded Aperture Light Field Imaging from Events
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 512
TokenLight: Precise Lighting Control in Images using Attribute Tokens
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 513
Kaleidoscopic Scintillation Event Imaging
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 514
gQIR: Generative Quanta Image Reconstruction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 515
Solving Minimal Problems Without Matrix Inversion Using FFT-Based Interpolation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 516
Predicting Spatial Transcriptomics from Histology Images via High-Order Multi-Cell Interaction Modeling
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 517
From Spots to Pixels: Dense Spatial Gene Expression Prediction from Histology Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 519
LightSplat: Fast and Memory-Efficient Open-Vocabulary 3D Scene Understanding in Five Seconds
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 520
Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 521
Zero-Shot Depth Completion with Vision-Language Model
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 522
FE2E: From Editor to Dense Geometry Estimator
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 523
Ego-1K – A Large-Scale Multiview Video Dataset for Egocentric Vision
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 524
Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 525
VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 526
NI-Tex: Non-isometric Image-based Garment Texture Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 527
Velox: Learning Representations of 4D Geometry and Appearance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 528
UniPixie: Unified and Probabilistic 3D Physics Learning via Flow Matching
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 529
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 530
Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 531
PartDiffuser: Part-wise 3D Mesh Generation via Discrete Diffusion
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 532
LoST: Level of Semantics Tokenization for 3D Shapes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 533
Lafite: A Generative Latent Field for 3D Native Texturing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 534
Image-Guided Geometric Stylization of 3D Meshes
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 535
LATTICE: Democratize High-Fidelity 3D Generation at Scale
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 536
Dehallu3D: Hallucination-Mitigated 3D Generation from a Single Image via Cyclic View Consistency Refinement
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 537
MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 538
TacSIm: A Dataset and Benchmark for Football Tactical Style Imitation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 539
DynamicsBoost: Dynamic Plausible Video Generation via Annotation-Free Continuation Preference Optimization
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 540
Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 541
Fine-Grained GRPO for Precise Preference Alignment in Flow Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 542
Lighting-grounded Video Generation with Renderer-based Agent Reasoning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 543
RewardFlow: Generate Images by Optimizing What You Reward
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 544
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 545
Self-Corrected Image Generation with Explainable Latent Rewards
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 546
Polyphony: Diffusion-based Dual-Hand Action Segmentation with Alternating Vision Transformer and Semantic Conditioning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 547
Reading Your Actions: Learning Generalizable Action Representations via Pre-training AEMG
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 548
MA-Bench: Towards Fine-grained Micro-Action Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 551
DarkShake-DVS: Event-based Human Action Recognition under Low-light and Shaking Camera Conditions
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 552
Protect to Adapt: Subspace-Constrained Adaptation with Ranked Negative Prompt Feedback for Few-Shot Action Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 553
SkeletonContext: Skeleton-side Context Prompt Learning for Zero-Shot Skeleton-based Action Recognition
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 554
InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 555
S^2FT: Parameter-Efficient Fine-Tuning in Sparse Spectrum Domain
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 556
Rethinking SNN Online Training and Deployment: Gradient-Coherent Learning via Hybrid-Driven LIF Model
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 558
Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 559
AdaBet: Gradient-free Layer Selection for Efficient Training of Deep Neural Networks
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 560
TAS-LoRA: Transformer Architecture Search with Mixture-of-LoRA Experts
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 561
QuCNet: Quantum Deep Learning Driven Multi-Circuit Network for Remote Sensing Image Classification
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 562
Learning to Solve PDEs on Neural Shape Representations
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 563
Frequency Switching Mechanism for Parameter-Efficient Multi-Task Learning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 564
Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 566
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 567
FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 568
Streamlined Open-Vocabulary Human-Object Interaction Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 569
Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 571
Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 572
Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 573
WeDetect: Fast Open-Vocabulary Object Detection as Retrieval
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 574
Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 575
OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 576
Annotation-Efficient Coreset Selection for Context-dependent Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 577
ALLNet: Multi-task Dense Prediction for Degraded Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 578
Geometry-Aware Cross-Modal Graph Alignment for Referring Segmentation in 3D Gaussian Splatting
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 580
GenMask: Adapting DiT for Segmentation via Direct Mask Generation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 581
Frequency-Aware Affinity for Weakly Supervised Semantic Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 582
Learning and Aligning Click-Aware Shape Prior for Interactive Amodal Instance Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 583
Beyond Reassembly: Fractured Object Recovery with Missing Parts
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 584
Best Segmentation Buddies for Image-Shape Correspondence
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 585
RMAE-ProGRess: Advancing Semantic Segmentation in Unstructured Environments
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 587
Orthogonal Spatial-Aware Multi-View Anchor Graph Clustering for Incomplete Remote Sensing Data
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 589
SkySense-VITA: Towards Universal In-context Segmentation of Multi-modal Remote Sensing Imagery
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 590
ProM3E: Probabilistic Masked MultiModal Embedding Model for Ecology
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 591
GeoCoT: Towards Reliable Remote Sensing Reasoning with Manifold Perspective
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 593
NeighborMAE: Exploiting Spatial Dependencies between Neighboring Earth Observation Images in Masked Autoencoders Pretraining
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 594
GeoDiT: A Diffusion-based Vision-Language Model for Geospatial Understanding
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 595
Balanced Hierarchical Contrastive Learning with Decoupled Queries for Fine-grained Object Detection in Remote Sensing Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 597
Improving Adversarial Transferability with Local Perturbation Augmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 598
Echoes of Ownership: Adversarial-Guided Dual Injection for Copyright Protection in MLLMs
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 599
Stealing Split Learning Bottom Models by Recovering Embedding Geometry
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 600
PoInit-of-View: Poisoning Initialization of Views Transfers Across Multiple 3D Reconstruction Systems
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 601
No Way To Steal My Face: Proactive Defense Against Identity-Preserving Personalized Generation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 602
Towards Reliable Evaluation of Adversarial Robustness for Spiking Neural Networks
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 603
Where, What, Why: Toward Explainable 3D-GS Watermarking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 604
Robust Spiking Neural Networks by Temporal Mutual Information
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 605
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 606
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 607
AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 608
Obstruction Reasoning for Robotic Grasping
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 609
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 610
CycleManip: Enabling Cycle-based Manipulation via Effective History Perception and Understanding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 611
SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 612
Adaptive Action Chunking at Inference-time for Vision-Language-Action Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 613
Localizing, Structuring, and Rendering: Bridging 3D and 2D Vision-Language-Action Models for Robotic Manipulation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 614
NIL: No-data Imitation Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 615
Humanoid Generative Pre-Training for Zero-Shot Motion Tracking
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 616
EnergyAction: Unimanual to Bimanual Composition with Energy-Based Models
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 617
CUBic: Coordinated Unified Bimanual Perception and Control Framework
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 618
RehearseVLA: Simulated Post-Training for VLAs with Physically-Consistent World Model
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 619
GraspGen-X: Cross-Embodiment 6-DOF Diffusion-based Grasping
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 620
UETrack: A Unified and Efficient Framework for Single Object Tracking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 621
ProgTrack: A Multi-Object Tracking Algorithm with Progressive Matching Strategy
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 622
Efficient Video Object Segmentation and Tracking with Recurrent Dynamic Submodel
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 623
Learning to Track Instance from Single Nature Language Description
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 624
MV-TAP: Tracking Any Point in Multi-View Videos
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 625
Adaptive Depth Lightweight RGB-T Tracking with Holistic Token Routing
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 626
Content-Adaptive Hierarchical Hyperprior for Neural Video Coding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 627
UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 628
Similarity-as-Evidence: Calibrating Overconfident VLMs for Interpretable and Label-Efficient Medical Active Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 629
From Infusion to Assimilation Distillation for Medical Image Segmentation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 631
Unlocking Positive Transfer in Incrementally Learning Surgical Instruments: A Self-reflection Hierarchical Prompt Framework
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 632
Keep It Frozen: Domain-Routed Conditional Residual Modulation for Multi-Domain Vision Transformers
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 634
MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 635
Turning Pre-Trained Vision Transformers into End-to-End Histopathology Whole Slide Image Models for Survival Prediction
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 636
A Supervised Multi-task Framework for Joint cryo-ET Restoration Enabled by Generative Physical Simulation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 637
KAMP: Knowledge-Anchored Multimodal Pretraining Framework for Medical Image Representation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 638
CARE: A Molecular-Guided Foundation Model with Adaptive Region Modeling for Whole Slide Image Analysis
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 639
Contrastive Cross-Bag Augmentation for Multiple Instance Learning-based Whole Slide Image Classification
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 640
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 641
Learning complete and explainable visual representations from itemized text supervision
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 642
EgoPoseFormer v2: Accurate Egocentric Human Motion Estimation for AR/VR
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 643
MetricHMSR: Metric Human Mesh and Scene Recovery from Monocular Images
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 644
Differentially Private 2D Human Pose Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 645
TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 646
PoseD-Flow: Versatile and Guided Flow Matching Model of Human Pose
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 648
HUMAPS-4D: A Multimodal Dataset for HUman Motion Analysis with Physiological and Semantic informations
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 649
PHASE-Net: Physics-Grounded Harmonic Attention System for Efficient Remote Photoplethysmography Measurement
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 650
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 651
Expanding mmWave Datasets for Human Pose Estimation with Unlabeled Data and LiDAR Datasets
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 652
Towards Balanced Multi-Modal Learning in 3D Human Pose Estimation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 653
OMGTex: One-stage Multi-style Facial Texture Reconstruction without Geometry Guidance
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 654
Human Interaction-Aware 3D Reconstruction from a Single Image
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 655
Towards Generalizable AI-Generated Image Detection via Image-Adaptive Prompt Learning
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 656
SAGA: Source Attribution of Generative AI Videos
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 657
VMD-FACT: A New Video Dataset and MLLM-based method for Detecting Realistic AI-Generated Video Misinformation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 658
ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 659
A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 660
PPM-CLIP: Probabilistic Prompt Modeling for Generalizable AI-Generated Image Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 661
Learning from Noisy Supervision: A Denoising–Debiasing Framework for Weakly Supervised Video Anomaly Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 662
Anomaly as Non-Conformity via Training-Free Graph Laplacian Energy Minimization
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 663
VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 664
CHAL: Causal-guided Hierarchical Anomaly-aware Learning for Moving Infrared Small Target Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 665
RAID: Retrieval-Augmented Anomaly Detection
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 666
ADSeeker: A Knowledge-Grounded Reasoning Framework for Industry Anomaly Detection and Reasoning
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 668
QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 669
GSV2X: Geometry-Aware Uncertainty Modeling and Orthogonal Fusion for Robust Roadside Perception
[
Poster]
Poster
Sat Jun 06 10:45 AM -- 12:45 PM (PDT) @ ExHall F 670
Grounded Latents for Entity-Centric 4D Scene Generation
[
Poster]
Successful Page Load