TurboBoA: Faster and Exact Attention-aware Quantization without Backpropagation Paper • 2602.04929 • Published Feb 4 • 1
Attention-aware Post-training Quantization without Backpropagation Paper • 2406.13474 • Published Jun 19, 2024 • 1