Evaluating Cross-Modal Reasoning Ability and Problem Characteristics with Multimodal Item Response Theory
arXiv:2603.02663v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) have recently emerged as general architectures capable of reasoning over diverse modalities. Benchmarks for MLLMs …
Shunki Uebayashi, Kento Masui, Kyohei Atarashi, Han Bao, Hisashi Kashima, Naoto Inoue, Mayu Otani, Koh Takeuchi
4 views