Diffusion Language Models
Collection
6 items • Updated
A diffusion-style masked language model fine-tuned in instruct mode using a discrete denoising objective.
Intended for instruction-following style infilling in chat-like prompts.
Example
from refinebert.diffusion_engine import MaskedDiffusionEngine
engine = MaskedDiffusionEngine("philipp-zettl/modernbert-diffusion-instruct")
prompt = "User: What is diffusion?
AI:"
output = engine.generate(prompt, num_new_tokens=30, steps=12, guidance_scale=3.0)
print(output)
Datasets are streamed from Hugging Face and mixed by mode.
| Dataset | Percentage | Purpose |
|---|---|---|
| HuggingFaceH4/ultrachat_200k (train_sft) | 100% | Instruction chat |
| Metric | Value |
|---|---|
| Training loss (latest) | 4.9687 |
| Training loss (mean) | 3.7032 |
| Training step | 50000 / 50000 |
Base model
answerdotai/ModernBERT-base