26 46

小明

xiaoming

xiaominghero

AI & ML interests

nlp

Recent Activity

liked a dataset 18 days ago

stepfun-ai/Step-3.5-Flash-SFT

liked a dataset 22 days ago

nvidia/Nemotron-Pretraining-Code-v1

upvoted an article 22 days ago

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

View all activity

Organizations

None yet

liked a dataset 18 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 19 days ago • 1.62M • 52.8k • 293

liked a dataset 22 days ago

nvidia/Nemotron-Pretraining-Code-v1

Viewer • Updated Dec 23, 2025 • 936M • 10.1k • 64

upvoted an article 22 days ago

Article

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

22 days ago

•

liked 2 models about 1 month ago

stepfun-ai/Step-3.5-Flash-Base-Midtrain

Text Generation • 198B • Updated 25 days ago • 212 • 40

stepfun-ai/Step-3.5-Flash-Base

Text Generation • 198B • Updated 25 days ago • 770 • 82

upvoted a paper about 2 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 193

upvoted a collection about 2 months ago

UltraData

Collection

Ultra Scale, Ultra Quality, Ultra Coverage • 9 items • Updated Feb 9 • 80

liked 3 models about 2 months ago

upvoted 2 papers 3 months ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published Jan 9 • 86

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 3 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87

upvoted a paper 4 months ago

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 133

upvoted an article 4 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

617

liked a Space 5 months ago

The Smol Training Playbook

📚

3.08k

The secrets to building world-class LLMs

liked a dataset 6 months ago

allenai/CoSyn-400K

Viewer • Updated Feb 28, 2025 • 408k • 2.12k • 47

upvoted a collection 7 months ago

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 28

liked a dataset 7 months ago

allenai/WildChat-4.8M

Viewer • Updated Aug 11, 2025 • 3.2M • 6.04k • 131

小明

AI & ML interests

Recent Activity

Organizations

xiaoming's activity

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook