Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs
arXiv:2602.20878v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) achieve strong performance on visual question answering benchmarks, yet often rely on spurious correlations rather than …
Dhita Putri Pratama, Soyeon Caren Han, Yihao Ding
24 views