Llama-2-7B-Chat Q4_K_M GGUF

Quantized from Junn17/llama using Unsloth.

Usage

llama-cli --model llama-2-7b-chat.Q4_K_M.gguf -p "Hello!"

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(101)

this model