Fine-Grained Visual Prompt and Region Self-Distillation for Retrieval-Augmented VQA
Yujie Wang, Hu Zhang, Jiye Liang, Zhiqiang Wang, Hongye Tan, Ru Li
Keywords:
Vision, Language, and Reasoning
Successful Page Load