Academic

Academic

Academic · 1 min

An Agentic Evaluation Framework for AI-Generated Scientific Code in PETSc

arXiv:2603.15976v1 Announce Type: new Abstract: While large language models have significantly accelerated scientific code generation, comprehensively evaluating the generated code remains a major challenge. Traditional …

Hong Zhang, Barry Smith, Satish Balay, Le Chen, Murat Keceli, Lois Curfman McInnes, Junchao Zhang
19 views
Academic · 1 min

Are Large Language Models Truly Smarter Than Humans?

arXiv:2603.16197v1 Announce Type: new Abstract: Public leaderboards increasingly suggest that large language models (LLMs) surpass human experts on benchmarks spanning academic knowledge, law, and programming. …

Eshwar Reddy M, Sourav Karmakar
18 views
Academic · 1 min

Robust Language Identification for Romansh Varieties

arXiv:2603.15969v1 Announce Type: new Abstract: The Romansh language has several regional varieties, called idioms, which sometimes have limited mutual intelligibility. Despite this linguistic diversity, there …

Charlotte Model, Sina Ahmadi, Jannis Vamvas
29 views
Academic · 1 min

Adaptive Theory of Mind for LLM-based Multi-Agent Coordination

arXiv:2603.16264v1 Announce Type: new Abstract: Theory of Mind (ToM) refers to the ability to reason about others' mental states, and higher-order ToM involves considering that …

Chunjiang Mu, Ya Zeng, Qiaosheng Zhang, Kun Shao, Chen Chu, Hao Guo, Danyang Jia, Zhen Wang, Shuyue Hu
234 views