KV Cache Transform Coding for Compact Storage in LLM Inference Paper • 2511.01815 • Published Nov 3, 2025 • 3
saricles/MiniMax-M2.5-REAP-172B-A10B-NVFP4-GB10 Text Generation • 98B • Updated 25 days ago • 1.04k • 10
saricles/MiniMax-M2.5-REAP-139B-A10B-NVFP4-GB10 Text Generation • 79B • Updated 24 days ago • 649 • 5