Q1_0_g128 CPU kernel fix + AVX2 SIMD (fork of PrismML-Eng/llama.cpp)

03ba2cd verified 5 days ago

185 Bytes

	---
	base_model:
	- {base_model}
	---
	# {model_name} GGUF

	Recommended way to run this model:

	```sh
	llama-server -hf {namespace}/{model_name}-GGUF
	```

	Then, access http://localhost:8080