ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 9 • 4
ParoQuant Collection Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 18 items • Updated 20 days ago • 19
SparseLoRA Collection Accelerating LLM Fine-Tuning with Contextual Sparsity • 4 items • Updated Mar 11 • 3
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory Paper • 2509.04439 • Published Sep 4, 2025 • 1
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper • 2510.12872 • Published Oct 14, 2025 • 4