view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 8 days ago • 43
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 8 days ago • 43
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17