felixZzz/c0105-8b-binary_KLmask_mu-cp2-shuffle-clip0.05-step_1460 Text Generation • 8B • Updated Jan 26
felixZzz/c0105-8b-binary_TVmask_mu-cp2-shuffle-clip100000-step_1670 Text Generation • 8B • Updated Jan 26 • 1
felixZzz/c0105-8b-binary_KLmask_mu-cp2-shuffle-clip0.05-step_1200 Text Generation • 8B • Updated Jan 26
felixZzz/c0105-8b-binary_TVmask_mu-cp2-shuffle-clip100000-step_1200 Text Generation • 8B • Updated Jan 26 • 3
felixZzz/np_4b_len16k_custom_teacher_custom_student_acc_rolloutY_principle_mix Viewer • Updated Nov 18, 2025 • 33.4k • 18
felixZzz/np_4b_len16k_custom_teacher_custom_student_acc_rolloutY_principle Viewer • Updated Nov 18, 2025 • 16.7k • 13
felixZzz/np_4b_len16k_custom_teacher_response-3-custom_8b_student_logps Viewer • Updated Nov 17, 2025 • 16.7k • 14
felixZzz/np_4b_len16k_custom_teacher_response-6-custom_8b_student_logps Viewer • Updated Nov 17, 2025 • 16.7k • 19
felixZzz/np_4b_len16k_custom_teacher_response-7-custom_8b_student_logps Viewer • Updated Nov 17, 2025 • 16.7k • 10
felixZzz/np_4b_len16k_custom_teacher_response-5-custom_8b_student_logps Viewer • Updated Nov 17, 2025 • 16.7k • 11
felixZzz/np_4b_len16k_custom_teacher_response-4-custom_8b_student_logps Viewer • Updated Nov 17, 2025 • 16.7k • 18