Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 21 days ago • 132
GLM-4.5-THIREUS-SPECIAL_SPLIT Collection These model shards are meant to be used with Thireus' GGUF Tool Suite - https://gguf.thireus.com/ • 56 items • Updated 7 days ago • 2
INT8 LLMs for vLLM Collection Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 47 items • Updated 16 days ago • 19