Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation • 8B • Updated Oct 29, 2024 • 10.2k • 680