KV Cache Transform Coding for Compact Storage in LLM Inference Paper • 2511.01815 • Published Nov 3, 2025 • 2
saricles/MiniMax-M2.5-REAP-172B-A10B-NVFP4-GB10 Text Generation • 98B • Updated 19 days ago • 846 • 8
saricles/MiniMax-M2.5-REAP-139B-A10B-NVFP4-GB10 Text Generation • 79B • Updated 19 days ago • 520 • 5