Did you skip gate quantization and what dataset did you use?
is it possible to load in vllm ?
· Sign up or log in to comment