Quantized LLama-based Models Collection llama based models quantized to various precisions • 6 items • Updated 16 days ago
jsbaicenter/Llama-3.3-70B-Instruct-FP8-Dynamic Text Generation • 71B • Updated 16 days ago • 1.12k
jsbaicenter/r1-1776-distill-llama-70b-FP8-Dynamic Text Generation • 71B • Updated 16 days ago • 16 • 1
jsbaicenter/Llama-3.2-3b-Instruct-AWQ-4bit-GEMM Text Generation • 3B • Updated May 4, 2025 • 11
jsbaicenter/r1-1776-distill-llama-70b-FP8-Dynamic Text Generation • 71B • Updated 16 days ago • 16 • 1
jsbaicenter/r1-1776-distill-llama-70b-FP8-Dynamic Text Generation • 71B • Updated 16 days ago • 16 • 1