Prompt Architecture Determines Reasoning Quality: A Variable Isolation Study on the Car Wash Problem
arXiv:2602.21814v1 Announce Type: new Abstract: Large language models consistently fail the "car wash problem," a viral reasoning benchmark requiring implicit physical constraint inference. We present …