view article Article How I contributed a new model to the Transformers library using Codex about 8 hours ago • 11
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 5 items • Updated 6 days ago • 18
view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 4 days ago • 28
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 24 days ago • 117
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 13 days ago • 58
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 Jun 3, 2025 • 101
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 14 days ago • 63
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53