MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 29 items • Updated 17 days ago • 73
Nemotron v3 Pre-Training Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 2 days ago • 11
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 26 items • Updated 2 days ago • 95
view article Article Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time Feb 18, 2025 • 35