CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • Updated about 23 hours ago • 50.5k • 583
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 14 days ago • 63
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated about 8 hours ago • 1.36M • 236
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published Feb 5 • 349