Qwen2.5-7B-Instruct Q4_K_M GGUF

Quantized from Junn17/qwen using Unsloth.

Usage

llama-cli --model qwen_model.Q4_K_M.gguf -p "Hello!"

GGUF

Model size

8B params

Architecture

qwen2

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Quantized

(279)

this model