CVPR Poster InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Poster

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Sirui Xu · Dongting Li · Yucheng Zhang · Xiyan Xu · Qi Long · Ziyin Wang · Yunzhi Lu · Shuchang Dong · Hezi Jiang · Akshat Gupta · Yu-Xiong Wang · Liangyan Gui

ExHall D Poster #162

[ Abstract ] [ Project Page ] [ Paper PDF ]

Fri 13 Jun 2 p.m. PDT — 4 p.m. PDT

Abstract:

While large-scale human motion capture datasets have advanced human motion generation, modeling and generating dynamic 3D human-object interactions (HOIs) remains challenging due to dataset limitations. These datasets often lack extensive, high-quality text-interaction pair data and exhibit artifacts such as contact penetration, floating, and incorrect hand motions. To address these issues, we introduce InterAct, a large-scale 3D HOI benchmark with key contributions in both dataset and methodology. First, we consolidate 21.81 hours of HOI data from diverse sources, standardizing and enriching them with detailed textual annotations. Second, we propose a unified optimization framework that enhances data quality by minimizing artifacts and restoring hand motions. Leveraging the insight of contact invariance, we preserve human-object relationships while introducing motion variations, thereby expanding the dataset to 30.70 hours. Third, we introduce six tasks to benchmark existing methods and develop a unified HOI generative model based on multi-task learning that achieves state-of-the-art results. Extensive experiments validate the utility of our dataset as a foundational resource for advancing 3D human-object interaction generation. The dataset will be publicly accessible to support further research in the field.

Live content is unavailable. Log in and register to view live content