ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning
arXiv:2603.06024v1 Announce Type: new Abstract: Multi-view spatial reasoning remains difficult for current vision-language models. Even when multiple viewpoints are available, models often underutilize cross-view relations …
Xingjian Tao, Yiwei Wang, Yujun Cai, Yifan Song, Jing Tang
17 views