Skip to main content

Category

Academic

Academic · 1 min

Benchmark Leakage Trap: Can We Trust LLM-based Recommendation?

arXiv:2602.13626v1 Announce Type: new Abstract: The expanding integration of Large Language Models (LLMs) into recommender systems poses critical challenges to evaluation reliability. This paper identifies …

Mingqiao Zhang, Qiyao Peng, Yumeng Wang, Chunyuan Liu, Hongtao Liu
5 views
Academic · 1 min

Optimized Certainty Equivalent Risk-Controlling Prediction Sets

arXiv:2602.13660v1 Announce Type: new Abstract: In safety-critical applications such as medical image segmentation, prediction systems must provide reliability guarantees that extend beyond conventional expected loss …

Jiayi Huang, Amirmohammad Farzaneh, Osvaldo Simeone
8 views