ยท
AI & ML interests
Large Language Models, Distributed Training and Inference
Recent Activity
Organizations
published an article over 1 year ago view article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2


- +4
published an article almost 2 years ago view article Saving Memory Using Padding-Free Transformer Layers during Finetuning
published an article almost 2 years ago view article Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model