·
AI & ML interests
None yet
Organizations
None yet
ou474747/Qwen2.5-0.5B-Instruct-sft-rl_cost-20260126_113619
ou474747/Qwen2.5-0.5B-Instruct-sft-rl_cost-20260126_094043
ou474747/Qwen2.5-0.5B-Instruct-sft-rl_cost-20260126_074539
ou474747/L3.2-1B-Ins-sft-cost-251216_025906
Updated
ou474747/L3.2-1B-Ins-sft-cost-251215_231340-ckpt-7500-sft-cost-251218_010241
Updated
ou474747/L3.2-1B-Ins-sft-cost-251215_231340-ckpt-7500-sft-cost-251217_214905
Updated
ou474747/L3.2-1B-Ins-sft-cost-251215_231340
Updated
ou474747/L3.2-1B-Ins-sft-cost-251215_123032
Updated
ou474747/L3.2-1B-Ins-pre_rl-251214_232231
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_strong
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-sft-cot-lr3e-5_sched-linear_ep3_bs8_gs4_multi
2B • Updated • 1
ou474747/DSR1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_baselineB_multi
Updated
ou474747/DSR1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_baselineB_single
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-sft-cot-lr3e-5_sched-linear_ep3_bs8_gs4_single
2B • Updated • 1
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_codedowB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr5.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_high_lrB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-linear_ep3_bs8_gs4_linear_schedB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_code_upB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_baselineB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr1.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_low_lrB
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_strong
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr5.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_high_lr
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-linear_ep3_bs8_gs4_linear_sched
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_code_up
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_code_down
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr2.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_baseline
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-rl-sft-cot-lr1.0e-6_sched-cosine_with_min_lr_ep3_bs8_gs4_low_lr
Updated
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-sft-cot-lr3e-5_sched-linear_ep3_bs8_gs4
2B • Updated • 1
ou474747/DeepSeek-R1-Distill-Qwen-1.5B-sft-cot-lr3e-5_sched-cosine_ep3_bs8_gs4
2B • Updated • 1