mklasby/self-distill-qwen-qwen3-30b-a3b-theblackcat102-evol-codealpaca-v1 Viewer • Updated 28 days ago • 111k • 73
mklasby/self-distill-qwen-qwen3-30b-a3b-theblackcat102-evol-codealpaca-v1 Viewer • Updated 28 days ago • 111k • 73
mklasby/self-distill__qwen-qwen3-30b-a3b__theblackcat102-evol-codealpaca-v1__greedy__seed-42 Viewer • Updated 29 days ago • 1.02k • 31
mklasby/self-distill__qwen-qwen3-30b-a3b__theblackcat102-evol-codealpaca-v1__greedy__seed-42 Viewer • Updated 29 days ago • 1.02k • 31
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 134