CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 11 days ago • 110
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated 14 days ago • 231
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated 14 days ago • 231
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 11 days ago • 110
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated 25 days ago • 48
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated 25 days ago • 48