ibm-granite
/

granite-4.0-1b-GGUF

Model card Files Files and versions

gabegoodhart commited on Dec 18, 2025

Commit

9309b02

·

verified ·

1 Parent(s): b65d715

Add precision error note

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -15,4 +15,10 @@ base_model:
 > This repository contains models that have been converted to the GGUF format with various quantizations from an IBM Granite base model.
 >
 > Please reference the base model's full model card here:
-> https://huggingface.co/ibm-granite/granite-4.0-1b

 > This repository contains models that have been converted to the GGUF format with various quantizations from an IBM Granite base model.
 >
 > Please reference the base model's full model card here:
+> https://huggingface.co/ibm-granite/granite-4.0-1b
+### Known Issues
+This model often uses the full numerical range of a 32-bit float (`f32`), so variants with smaller numerical ranges may run into precision errors at inference. The `F16` variant is known to fail on many hardware combinations.
+**The recommended full-precision variant is `bf16`**.