PreSINQ GGUF
Collection
This collection contains SINQ GGUF models • 4 items • Updated • 3
None defined yet.
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding