Agentified Assessment of Logical Reasoning Agents
arXiv:2603.02788v1 Announce Type: new Abstract: We present a framework for evaluating and benchmarking logical reasoning agents when assessment itself must be reproducible, auditable, and robust …
Zhiyu Ni, Yifeng Xiao, Zheng Liang
9 views