Advancing Multimodal Judge Models through a Capability-Oriented Benchmark and MCTS-Driven Data Generation
arXiv:2603.00546v1 Announce Type: new Abstract: Using Multimodal Large Language Models (MLLMs) as judges to achieve precise and consistent evaluations has gradually become an emerging paradigm …
Zeyu Chen, Huanjin Yao, Ziwang Zhao, Min Yang
10 views