Quantization
#1
by hadadrjt - opened
Are there any plans for quantization, such as 2-bit and 4-bit with Ollama? This could reduce resource usage.
Yeah
Are there any plans for quantization, such as 2-bit and 4-bit with Ollama? This could reduce resource usage.
Yeah