[2606.03819] TreeFlash: Parallel AR-Approximation for Faster Speculative Decoding
AI disclosure
Summary
Abstract page for arXiv paper 2606.03819: TreeFlash: Parallel AR-Approximation for Faster Speculative Decoding