common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 47.2k
• 692
Text Generation
• 7B • Updated • 5.48k
• 321
shareAI/ShareGPT-Chinese-English-90k
Preview
• Updated • 1.33k
• 279
Viewer
• Updated • 207M • 31k
• 497
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 68.6k
• 455
Viewer
• Updated • 968M • 39.8k
• 905
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
• Updated • 70k • 1.75k
• 196
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
• Updated • 1.6k
• 173
Updated • 1.29k
• 123
microsoft/orca-math-word-problems-200k
Viewer
• Updated • 200k • 11.2k
• 479
Preview
• Updated • 138
• 27
Viewer
• Updated • 52.5B • 650k
• 2.77k
Yukang/LongAlpaca-16k-length
Viewer
• Updated • 6.28k • 26
• 25
Viewer
• Updated • 51.8k • 31.7k
• 813
Viewer
• Updated • 343M • 566
• 11
NousResearch/json-mode-eval
Viewer
• Updated • 100 • 374
• 43
NousResearch/func-calling-eval-singleturn
Viewer
• Updated • 112 • 20
• 8
NousResearch/func-calling-eval-glaive
Viewer
• Updated • 100 • 17
• 9
legacy-datasets/wikipedia
Updated • 99.6k
• 624
Viewer
• Updated • 10.4B • 722k
• 557
open-web-math/open-web-math
Viewer
• Updated • 6.32M • 22.4k
• 333
codeparrot/github-code-clean
Viewer
• Updated • 11M • 13.3k
• 137
HuggingFaceFW/fineweb-edu-score-2
Viewer
• Updated • 13.9B • 19.5k
• 85
HuggingFaceFW/fineweb-edu
Viewer
• Updated • 3.5B • 358k
• 1.04k
Viewer
• Updated • 52k • 89.3k
• 948
Viewer
• Updated • 772k • 41
• 27
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
• Updated • 143k • 6
• 11
Viewer
• Updated • 2.94M • 33.8k
• 1.52k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
• Updated • 143k • 4.29k
• 249
timdettmers/openassistant-guanaco
Viewer
• Updated • 10.4k • 12.3k
• 441
garage-bAInd/Open-Platypus
Viewer
• Updated • 24.9k • 11.1k
• 416
Viewer
• Updated • 3.71M • 1.23M
• 672
Updated • 268
• 225
Salesforce/xlam-function-calling-60k
Viewer
• Updated • 60k • 10.1k
• 605
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 48.1k
• 450
glaiveai/glaive-function-calling-v2
Viewer
• Updated • 113k • 17k
• 501
mlfoundations/dclm-baseline-1.0-parquet
Viewer
• Updated • 2.73B • 8.21k
• 38
mlfoundations/dclm-baseline-1.0
Preview
• Updated • 118k
• 263
ruslanmv/ai-medical-chatbot
Viewer
• Updated • 257k • 1.08k
• 247
Viewer
• Updated • 100k • 10.9k
• 266
Viewer
• Updated • 69.9k • 189k
• 393
xzuyn/manythings-translations-alpaca
Viewer
• Updated • 6.33M • 31
• 8
Viewer
• Updated • 21.9M • 4.69k
• 712
Viewer
• Updated • 1.75M • 194
• 105
mlabonne/open-perfectblend
Viewer
• Updated • 1.42M • 1.77k
• 72
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
• Updated • 1.05M • 150
• 67
allenai/tulu-3-sft-mixture
Viewer
• Updated • 939k • 16.2k
• 235
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated • 16.4k • 3.11k
• 186
Viewer
• Updated • 552M • 712
• 3
Viewer
• Updated • 78.1M • 733
• 6
Viewer
• Updated • 1.13M • 246
• 11
Viewer
• Updated • 16.2M • 231
• 1
Viewer
• Updated • 172k • 50
• 2
Viewer
• Updated • 62.3k • 48
• 2
Viewer
• Updated • 72.1k • 21
• 1
lianghsun/tw-instruct-500k
Viewer
• Updated • 500k • 204
• 25