Poster
ImPortrait: Implicit Condition Control for Enhanced Portrait Animation
Zunnan Xu · Zhentao Yu · Zixiang Zhou · Jun Zhou · Xiaoyu Jin · Fa-Ting Hong · Xiaozhong Ji · Junwei Zhu · Chengfei Cai · Shiyu Tang · Qin Lin · Xiu Li · qinglin lu
We introduce ImPortrait, a diffusion-based condition control method that employs implicit representations for highly controllable and lifelike portrait animation. Given a single portrait image as an appearance reference and video clips as driving templates, ImPortrait can animate the character in the reference image by the facial expression and head pose of the driving videos. In our framework, we utilize pre-trained encoders to achieve the decoupling of portrait motion information and identity in videos. To do so, implicit representation is adopted to encode motion information and is employed as control signals in the animation phase. By leveraging the power of stable video diffusion (SVD) as the main building block, we carefully design adapter layers to inject control signals into denoising unet through attention mechanisms. These bring spatial richness of details and temporal consistency. ImPortrait also exhibits strong generalization performance, which can effectively disentangle appearance and motion under different image styles. Our framework outperforms existing methods, demonstrating superior temporal consistency and controllability.
Live content is unavailable. Log in and register to view live content