6 22

Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a dataset about 2 months ago

ILSVRC/imagenet-1k

liked a dataset 9 months ago

LEAP/ClimSim_high-res

upvoted an article 9 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

None yet

liked a dataset about 2 months ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 115k • 755

liked a dataset 9 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 42.5k • 12

upvoted an article 9 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

737

liked a dataset 10 months ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 126 • 2

liked 2 models 11 months ago

allenai/ACE2-ERA5

Updated 15 days ago • 56 • 16

microsoft/aurora

Updated Jun 20, 2025 • 50

upvoted an article 12 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked 3 Spaces about 1 year ago

Memory Viz

🧠

Memory Viz

Predict Memory

🧮

106

Calculate and visualize memory usage for model training

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 1 year ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

liked 2 datasets about 1 year ago

PleIAs/common_corpus

Viewer • Updated Feb 19 • 69.9k • 183k • 387

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 281k • 998

liked 3 models about 1 year ago

upvoted a collection about 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 159

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 5.36M • 1.01k

liked 2 Spaces about 1 year ago

TheWell

🌍

Visualization of data from the Well

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale