Running Featured 1.3k FineWeb: decanting the web for the finest text data at scale 🍷 1.3k Generate a curated web‑text dataset for LLM training
MT5 release Collection The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated Jul 10, 2025 • 23