Running 3.79k The Ultra-Scale Playbook π 3.79k The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 492k β’ β’ 2.7k
togethercomputer/RedPajama-INCITE-Instruct-3B-v1 Text Generation β’ Updated May 9, 2023 β’ 1.91k β’ 93