Systematic SFT for Qwen3-4B. We explore diverse dataset compositions and training recipes to benchmark and improve performance across tasks.
AI & ML interests
Pioneering the Next Era of AI with Vector Intelligence
Recent Activity
models
35
dnotitia/Qwen3-0.6B-Base
Text Generation
•
0.6B
•
Updated
•
14
dnotitia/Qwen3-0.6B
Text Generation
•
0.8B
•
Updated
•
67
dnotitia/Qwen3-1.7B-Base
Text Generation
•
2B
•
Updated
•
5
dnotitia/Qwen3-1.7B
Text Generation
•
2B
•
Updated
•
22
dnotitia/Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
16
dnotitia/Qwen3-4B
Text Generation
•
4B
•
Updated
•
41
dnotitia/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
•
81
dnotitia/Qwen3-4B-Thinking-2507
Text Generation
•
4B
•
Updated
•
235
dnotitia/DNA-2.1-14B
Text Generation
•
15B
•
Updated
•
6
•
1
dnotitia/DNA-2.0-14B
Text Generation
•
15B
•
Updated
•
113
•
11