Running 3.76k The Ultra-Scale Playbook π 3.76k The ultimate guide to training LLM on large GPU Clusters
mattshumer/Reflection-Llama-3.1-70B Text Generation β’ 71B β’ Updated Sep 24, 2024 β’ 308 β’ 1.71k
MaziyarPanahi/MixTAO-7Bx2-MoE-Instruct-v7.0-GGUF Text Generation β’ 13B β’ Updated Feb 4, 2024 β’ 183 β’ 9