Updated • 6.73k
• 196
Viewer
• Updated • 170M • 22.7k
• 90
Viewer
• Updated • 621M • 11.9k
• 87
Locutusque/UltraTextbooks
Viewer
• Updated • 5.52M • 676
• 198
PrimeIntellect/StackV1-popular
Viewer
• Updated • 93M • 904
• 2
Viewer
• Updated • 11.7M • 48
• 5
EleutherAI/the_pile_deduplicated
Viewer
• Updated • 134M • 22.1k
• 110
HIT-TMG/KaLM-embedding-pretrain-data
Viewer
• Updated • 23.7M • 1.84k
• 20
suriyagunasekar/stackoverflow-with-meta-data
Viewer
• Updated • 19.9M • 221
• 12
Viewer
• Updated • 13.6M • 1.12k
• 5
Viewer
• Updated • 3.71M • 1.02M
• 650
Viewer
• Updated • 474M • 65
• 4
EleutherAI/deep-ignorance-annealing-mix
Viewer
• Updated • 89M • 74
• 1
Viewer
• Updated • 10.2M • 45
• 5
Viewer
• Updated • 1.76M • 23.9k
• 403
Viewer
• Updated • 167M • 3.86k
• 68
Locutusque/deeplm-training-data
Viewer
• Updated • 2.17M • 130
• 3
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
• Updated • 3.91M • 4.09k
• 645
Updated • 51.9k
• 248
EssentialAI/essential-web-v1.0
Preview
• Updated • 47.6k
• 219