PhGPO: Pheromone-Guided Policy Optimization for Long-Horizon Tool Planning
arXiv:2602.13691v1 Announce Type: new Abstract: Recent advancements in Large Language Model (LLM) agents have demonstrated strong capabilities in executing complex tasks through tool use. However, …
Yu Li, Guangfeng Cai, Shengtian Yang, Han Luo, Shuo Han, Xu He, Dong Li, Lei Feng
3 views