World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models
arXiv:2603.09774v1 Announce Type: new Abstract: Achieving robust spatial reasoning remains a fundamental challenge for current Multimodal Foundation Models (MFMs). Existing methods either overfit statistical shortcuts …
Shouwei Ruan, Bin Wang, Zhenyu Wu, Qihui Zhu, Yuxiang Zhang, Hang Su, Yubin Wang
14 views