Distilling Counterfactual Reasoning from Language to Vision: Causal Graph-Guided Post-Training for Video Understanding
Yuefei Chen, Jiang Liu, Xiaodong Lin, Ruixiang Tang
Keywords:
Vision, Language, and Reasoning
Successful Page Load