2 14

do

cocodd

AI & ML interests

None yet

Recent Activity

liked a dataset 25 days ago

HuggingFaceTB/smoltalk2

liked a dataset 2 months ago

ConvLab/kvret

liked a model 2 months ago

BAAI/CCI3-HQ-Classifier

View all activity

Organizations

None yet

liked a dataset 25 days ago

HuggingFaceTB/smoltalk2

Viewer • Updated Oct 31, 2025 • 8.61M • 10.6k • 145

liked a dataset 2 months ago

ConvLab/kvret

Preview • Updated Nov 25, 2022 • 92 • 4

liked a model 2 months ago

BAAI/CCI3-HQ-Classifier

0.6B • Updated Oct 28, 2024 • 25 • 11

liked a dataset 3 months ago

liwu/MNBVC

Updated Dec 3, 2025 • 202k • 592

liked a Space 4 months ago

The Smol Training Playbook

📚

3.06k

The secrets to building world-class LLMs

liked a Space 5 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Visualize on-policy distillation for any model family

liked a dataset 6 months ago

withmartian/routerbench

Updated Mar 27, 2024 • 496 • 23

liked a model 7 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 224k • • 2.43k

liked a Space 8 months ago

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

liked 2 datasets 8 months ago

POLARIS-Project/Polaris-Dataset-53K

Viewer • Updated Jun 18, 2025 • 53.3k • 790 • 34

pe-nlp/DeepScaleR-40k-Prompt

Viewer • Updated Feb 17, 2025 • 40.3k • 7 • 1

liked 2 datasets 9 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 780 • 731

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 294 • 221

liked a Space about 1 year ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Read a detailed overview of the FineWeb web‑scale text dataset

do