Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning Lead | College Professor | GenAI | MLOps
Recent Activity
liked
a Space
3 days ago
eliebak/sparsity-viz
updated
a dataset
7 days ago
Shekswess/fineweb-edu-700m
published
a dataset
7 days ago
Shekswess/fineweb-edu-700m
Organizations
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
-
Shekswess/trlm-135m
Text Generation • 0.1B • Updated • 61 • 46 -
Shekswess/trlm-stage-3-dpo-final-2
Text Generation • 0.1B • Updated • 1 • 1 -
Shekswess/trlm-stage-2-sft-final-2
Text Generation • 0.1B • Updated • 3 • 1 -
Shekswess/trlm-stage-1-sft-final-2
Text Generation • 0.1B • Updated • 2 • 1
Tiny Think
Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
-
Shekswess/trlm-135m
Text Generation • 0.1B • Updated • 61 • 46 -
Shekswess/trlm-stage-3-dpo-final-2
Text Generation • 0.1B • Updated • 1 • 1 -
Shekswess/trlm-stage-2-sft-final-2
Text Generation • 0.1B • Updated • 3 • 1 -
Shekswess/trlm-stage-1-sft-final-2
Text Generation • 0.1B • Updated • 2 • 1
models
31
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
75
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
70
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
148
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr5e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
75
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
73
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
125
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e2-bs8
Text Generation
•
0.1B
•
Updated
•
63
datasets
35
Shekswess/fineweb-edu-700m
Viewer
•
Updated
•
681k
•
26
Shekswess/tiny-think-sft-math-n-stem
Viewer
•
Updated
•
29.1k
•
56
Shekswess/tiny-think-dpo-math-n-stem
Viewer
•
Updated
•
2.86k
•
80
Shekswess/trlm-sft-stage-1-final-2
Viewer
•
Updated
•
58k
•
6
Shekswess/trlm-sft-stage-2-final-2
Viewer
•
Updated
•
78k
•
185
Shekswess/trlm-dpo-stage-3-final-2
Viewer
•
Updated
•
50k
•
33
Shekswess/customer-support
Viewer
•
Updated
•
1k
•
24
•
1
Shekswess/scientific-research
Viewer
•
Updated
•
1k
•
9
•
4
Shekswess/technical-manuals
Viewer
•
Updated
•
1k
•
17
•
4
Shekswess/legal-documents
Viewer
•
Updated
•
1k
•
53
•
5