Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
0xSero
/
GLM-4.6-REAP-218B-A32B-W4A16-AutoRound
like
8
Text Generation
Transformers
Safetensors
English
glm4_moe
glm
glm4
MOE
pruning
reap
cerebras
quantized
autoround
4bit
w4a16
conversational
4-bit precision
auto-round
arxiv:
2510.13999
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
sglang inference
#1
by
henryhaohao
- opened
Jan 3
Discussion
henryhaohao
Jan 3
Does anyone know how to run inference on this model with sglang?
See translation
🔥
1
1
+
henryhaohao
changed discussion status to
closed
Jan 3
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment