AI & ML interests
None yet
Organizations
None yet
ShikangWang/mistral_12b_sft_dpo
12B
•
Updated
ShikangWang/mistral_12b_sft_300k
12B
•
Updated
ShikangWang/pk_family_0.3_grpo_sft_filter_kl_0.001
12B
•
Updated
ShikangWang/pk_grpo_sft_filter_kl_0.001
12B
•
Updated
ShikangWang/mistral_12b_sft_roleplay
12B
•
Updated
•
3
•
1
ShikangWang/smo-family-v2-0.3-filter-1127
2B
•
Updated
ShikangWang/mistral_12b_sft_1125
12B
•
Updated
•
1
ShikangWang/smo-family-v2-0.3-filter-1126
2B
•
Updated
ShikangWang/pk_grpo_sft_filter_kl_0.002_en_0.005
12B
•
Updated
ShikangWang/smo-family-0.3-filter_ep1
2B
•
Updated
ShikangWang/pk_family_0.3_grpo_sft_filter_kl_0.02_en_0.01
12B
•
Updated
ShikangWang/pk_family_0.3_grpo_sft_filter
12B
•
Updated
ShikangWang/mistral_12b_sft
12B
•
Updated
•
1
ShikangWang/smo-family-0.3-filter
2B
•
Updated
•
2
ShikangWang/pk_family_0.3_grpo_src
12B
•
Updated
ShikangWang/pk_family_0.0_grpo
12B
•
Updated
ShikangWang/pk_family_0.3_grpo
12B
•
Updated
•
1
2B
•
Updated
ShikangWang/smo-pk-family
2B
•
Updated
ShikangWang/mistral_12b_grpo_safe20k
12B
•
Updated
•
169
0.4B
•
Updated
ShikangWang/model110_grpo_safe_20kv2
12B
•
Updated
•
2
ShikangWang/model110_grpo_safe_20k
12B
•
Updated
ShikangWang/model110_grpo_50k
12B
•
Updated
•
3
ShikangWang/model110_grpo_10k
12B
•
Updated
ShikangWang/model110_dpo_ftx_10_filter20_step7500
12B
•
Updated
ShikangWang/model110_dpo_ftx_5_filter20
12B
•
Updated