Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection
arXiv:2602.11247v1 Announce Type: cross Abstract: Multi-turn prompt injection attacks distribute malicious intent across multiple conversation turns, exploiting the assumption that each turn is evaluated independently. …
J Alex Corll
16 views