A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models
arXiv:2602.15689v1 Announce Type: new Abstract: Large language models and LLM-based agents are increasingly used for cybersecurity tasks that are inherently dual-use. Existing approaches to refusal, spanning academic policy frameworks and commercially deployed systems, often rely on broad topic-based bans or...
Fairness, accountability and transparency: notes on algorithmic decision-making in criminal justice
AbstractOver the last few years, legal scholars, policy-makers, activists and others have generated a vast and rapidly expanding literature concerning the ethical ramifications of using artificial intelligence, machine learning, big data and predictive software in criminal justice contexts. These concerns...
AgentLAB: Benchmarking LLM Agents against Long-Horizon Attacks
arXiv:2602.16901v1 Announce Type: new Abstract: LLM agents are increasingly deployed in long-horizon, complex environments to solve challenging problems, but this expansion exposes them to long-horizon attacks that exploit multi-turn user-agent-environment interactions to achieve objectives infeasible in single-turn settings. To measure...
The anticipated criminal law decisions and arguments for the rest of this term
ScotusCrim is a recurring series by Rory Little focusing on intersections between the Supreme Court and criminal law. Today’s column is my busman’s holiday project: providing nerd-like numbers and information […]The postThe anticipated criminal law decisions and arguments for the...