Arthur Zucker's picture

In a Training Loop 🔄

Arthur Zucker PRO

ArthurZ

huggingface

·

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

adarshxs/deep-gemm

updated a dataset about 20 hours ago

transformers-community/circleci-test-results

updated a dataset 1 day ago

huggingface/documentation-images

View all activity

Organizations

published an article 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

+5

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 159

published an article 5 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 124

published an article 5 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 310

published an article 6 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 379

published an article 8 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 187

published an article 12 months ago

Article

The Transformers Library: standardizing model definitions

+2

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 121

published an article over 1 year ago

Article

Fixing Gradient Accumulation

+4

lysandre, ArthurZ, muellerzr, ydshieh, BenjaminB, pcuenq

•

Oct 16, 2024

• 66

published an article over 1 year ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

+4

RQlee, ArthurZ, achikundu, lwtr, rganti, mayank-mishra

•

Aug 21, 2024

• 41

published an article about 2 years ago

Article

Fine-Tuning Gemma Models in Hugging Face

+2

svaibhav, alanwaketan, ybelkada, ArthurZ

•

Feb 23, 2024

• 46

published an article over 2 years ago

Article

Code Llama: Llama 2 learns to code

+6

philschmid, osanseviero, pcuenq, lewtun, lvwerra, loubnabnl, ArthurZ, joaogante

•

Aug 25, 2023

• 10

published an article over 2 years ago

Article

Code Llama: Llama 2 learns to code

+6

philschmid, osanseviero, pcuenq, lewtun, lvwerra, loubnabnl, ArthurZ, joaogante

•

Aug 25, 2023

• 10