MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts
arXiv:2604.06505v1 Announce Type: new Abstract: Large language models (LLMs) are widely explored for reasoning-intensive research tasks, yet resources for testing whether they can infer scientific …
Weiyue Li, Ruizhi Qian, Yi Li, Yongce Li, Yunfan Long, Jiahui Cai, Yan Luo, Mengyu Wang
17 views