A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models
arXiv:2602.15689v1 Announce Type: new Abstract: Large language models and LLM-based agents are increasingly used for cybersecurity tasks that are inherently dual-use. Existing approaches to refusal, …
Meirav Segal, Noa Linder, Omer Antverg, Gil Gekker, Tomer Fichman, Omri Bodenheimer, Edan Maor, Omer Nevo
6 views