Academic

Academic

Academic · 1 min

LLM Routing as Reasoning: A MaxSAT View

arXiv:2603.13612v1 Announce Type: new Abstract: Routing a query through an appropriate LLM is challenging, particularly when user preferences are expressed in natural language and model …

Son Nguyen, Xinyuan Liu, Ransalu Senanayake
10 views
Academic · 1 min

EviAgent: Evidence-Driven Agent for Radiology Report Generation

arXiv:2603.13956v1 Announce Type: new Abstract: Automated radiology report generation holds immense potential to alleviate the heavy workload of radiologists. Despite the formidable vision-language capabilities of …

Tuoshi Qi, Shenshen Bu, Yingfei Xiang, Zhiming Dai
6 views
Academic · 1 min

State Algebra for Probabilistic Logic

arXiv:2603.13574v1 Announce Type: new Abstract: This paper presents a Probabilistic State Algebra as an extension of deterministic propositional logic, providing a computational framework for constructing …

Dmitry Lesnik, Tobias Sch\"afer
30 views
Academic · 1 min

vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models

arXiv:2603.13966v1 Announce Type: new Abstract: Vision Language Action VLA models are typically evaluated using per benchmark scripts maintained independently by each model repository, leading to …

Suhwan Choi, Yunsung Lee, Yubeen Park, Chris Dongjoo Kim, Ranjay Krishna, Dieter Fox, Youngjae Yu
7 views
Academic · 1 min

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

arXiv:2603.13594v1 Announce Type: new Abstract: Large language models are shifting from passive information providers to active agents intended for complex workflows. However, their deployment as …

Shiva Krishna Reddy Malay, Shravan Nayak, Jishnu Sethumadhavan Nair, Sagar Davasam, Aman Tiwari, Sathwik Tejaswi Madhusudhan, Sridhar Krishna Nemala, Srinivas Sunkara, Sai Rajeswar
6 views