view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding Mar 19 • 46
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503