Dynamic Delayed Tree Expansion For Improved Multi-Path Speculative Decoding
arXiv:2602.16994v1 Announce Type: new Abstract: Multi-path speculative decoding accelerates lossless sampling from a target model by using a cheaper draft model to generate a draft …
Rahul Thomas, Teo Kitanovski, Micah Goldblum, Arka Pal
5 views