All Articles

Articles

Academic · 1 min

Aligning Language Models from User Interactions

arXiv:2603.12273v1 Announce Type: cross Abstract: Multi-turn user interactions are among the most abundant data produced by language models, yet we lack effective methods to learn …

Thomas Kleine Buening, Jonas H\"ubotter, Barna P\'asztor, Idan Shenfeld, Giorgia Ramponi, Andreas Krause
14 views
Academic · 1 min

AI Planning Framework for LLM-Based Web Agents

arXiv:2603.12710v1 Announce Type: new Abstract: Developing autonomous agents for web-based tasks is a core challenge in AI. While Large Language Model (LLM) agents can interpret …

Orit Shahnovsky, Rotem Dror
14 views
Academic · 1 min

Developing and evaluating a chatbot to support maternal health care

arXiv:2603.13168v1 Announce Type: new Abstract: The ability to provide trustworthy maternal health information using phone-based chatbots can have a significant impact, particularly in low-resource settings …

Smriti Jha, Vidhi Jain, Jianyu Xu, Grace Liu, Sowmya Ramesh, Jitender Nagpal, Gretchen Chapman, Benjamin Bellows, Siddhartha Goyal, Aarti Singh, Bryan Wilder
17 views
Academic · 1 min

Maximum Entropy Exploration Without the Rollouts

arXiv:2603.12325v1 Announce Type: cross Abstract: Efficient exploration remains a central challenge in reinforcement learning, serving as a useful pretraining objective for data collection, particularly when …

Jacob Adamczyk, Adam Kamoski, Rahul V. Kulkarni
49 views