Knowledge Packs: Zero-Token Knowledge Delivery via KV Cache Injection
arXiv:2604.03270v1 Announce Type: new Abstract: RAG wastes tokens. We propose Knowledge Packs: pre-computed KV caches that deliver the same knowledge at zero token cost. For …
Andrey Pustovit
9 views