·
AI & ML interests
None yet
Organizations
None yet
models 16
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_4
Text Generation
• 2B • Updated
• 3
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_2
Text Generation
• 2B • Updated
• 2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_6
Text Generation
• 2B • Updated
• 2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_2_epoch_2
Text Generation
• 2B • Updated
• 2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_1
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_12
2B • Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_3
Text Generation
• 2B • Updated
• 2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-3-24-1300
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-3-24
Updated
EricLabile/Qwen-2.5-7B-Simple-RL
Updated