All Articles

Articles

Academic · 1 min

Learning When to Trust in Contextual Bandits

arXiv:2603.13356v1 Announce Type: new Abstract: Standard approaches to Robust Reinforcement Learning assume that feedback sources are either globally trustworthy or globally adversarial. In this paper, …

Majid Ghasemi, Mark Crowley
4 views
Academic · 1 min

TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics

arXiv:2603.13676v1 Announce Type: new Abstract: PET theranostics is transforming precision oncology, yet treatment response varies substantially; many patients receiving 177Lu-PSMA radioligand therapy (RLT) for metastatic …

Zhihao Chen, Jiahui Wang, Yizhou Chen, Xiaozhong Ji, Xiaobin Hu, Jimin Hong, Wolfram Andreas Bosbach, Axel Rominger, Ali Afshar-Oromieh, Hongming Shan, Kuangyu Shi
7 views
Academic · 1 min

LLM Routing as Reasoning: A MaxSAT View

arXiv:2603.13612v1 Announce Type: new Abstract: Routing a query through an appropriate LLM is challenging, particularly when user preferences are expressed in natural language and model …

Son Nguyen, Xinyuan Liu, Ransalu Senanayake
12 views