MiscSpaces - a ldwang Collection

ldwang 's Collections

MiscR1

MiscSpaces

updated Nov 6, 2025

Running

593

Scaling test-time compute

📈

593

Run advanced search strategies to boost LLM problem solving
Running

Featured

1.29k

FineWeb: decanting the web for the finest text data at scale

🍷

1.29k

Explore the FineWeb dataset and its creation process
Running

3.7k

The Ultra-Scale Playbook

🌌

3.7k

The ultimate guide to training LLM on large GPU Clusters
Running

218

FineVision: Open Data is All You Need

📝

218

A new open-source dataset for training VLMs
Running

19

Megatron Memory Estimator

👁

19

Estimate GPU memory usage for Megatron models
Running on Zero

19

Smol2Operator Demo

🐢

19

Smol2Operator Demo: GUI Agent Model
Running on CPU Upgrade

Featured

3k

The Smol Training Playbook

📚

3k

The secrets to building world-class LLMs
Running

86

Unlocking On-Policy Distillation for Any Model Family

📝

86

Visualize on-policy distillation for any model family