Skip to main content

All Articles

Articles

Academic · 1 min

Improved Upper Bounds for Slicing the Hypercube

arXiv:2602.16807v1 Announce Type: new Abstract: A collection of hyperplanes $\mathcal{H}$ slices all edges of the $n$-dimensional hypercube $Q_n$ with vertex set $\{-1,1\}^n$ if, for every …

Duncan Soiffer, Nathaniel Itty, Christopher D. Rosin, Blake Bruell, Mason DiCicco, G\'abor N. S\'ark\"ozy, Ryan Offstein, Daniel Reichman
5 views
Academic · 1 min

OpenSage: Self-programming Agent Generation Engine

arXiv:2602.16891v1 Announce Type: new Abstract: Agent development kits (ADKs) provide effective platforms and tooling for constructing agents, and their designs are critical to the constructed …

Hongwei Li, Zhun Wang, Qinrun Dai, Yuzhou Nie, Jinjun Peng, Ruitong Liu, Jingyang Zhang, Kaijie Zhu, Jingxuan He, Lun Wang, Yangruibo Ding, Yueqi Chen, Wenbo Guo, Dawn Song
7 views
Academic · 1 min

AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks

arXiv:2602.16901v1 Announce Type: new Abstract: LLM agents are increasingly deployed in long-horizon, complex environments to solve challenging problems, but this expansion exposes them to long-horizon …

Tanqiu Jiang, Yuhui Wang, Jiacheng Liang, Ting Wang
7 views
Academic · 1 min

SourceBench: Can AI Answers Reference Quality Web Sources?

arXiv:2602.16942v1 Announce Type: new Abstract: Large language models (LLMs) increasingly answer queries by citing web sources, but existing evaluations emphasize answer correctness rather than evidence …

Hexi Jin, Stephen Liu, Yuheng Li, Simran Malik, Yiying Zhang
4 views