Granite 2.0 Code Models Collection Code models for generation, understanding, and instruction-following tasks. • 22 items • Updated 3 days ago • 202
Granite 3.0 Language Models Collection Language models for enterprise-grade text generation. • 8 items • Updated 3 days ago • 101
Enhancing Training Efficiency Using Packing with Flash Attention Paper • 2407.09105 • Published Jul 12, 2024 • 17
view article Article Saving Memory Using Padding-Free Transformer Layers during Finetuning Jun 11, 2024 • 21