·
AI & ML interests
None yet
Organizations
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.01_n_generations_12
2B • Updated • 1
LLucass/Tanh_PRESS_GRPO_4.0_beta_0.01_n_generations_12
2B • Updated LLucass/Tanh_PRESS_GRPO_0.5_beta_0.01_n_generations_12
2B • Updated LLucass/PRESS_GRPO_2.0_beta_0.01_n_generation_12
2B • Updated LLucass/GRPO_beta_0.01_n_generation_12
2B • Updated LLucass/Tanh_PRESS_GRPO_2.0_beta_0.04
2B • Updated • 1
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.04
2B • Updated LLucass/Tanh_PRESS_GRPO_2.0_beta_0.01
2B • Updated LLucass/ACC_GRPO_beta_0.01
2B • Updated • 1
LLucass/ACC_PRESS_GRPO_2.0_beta_0.01
2B • Updated LLucass/PRESS_GRPO_4.0_beta_0.01
2B • Updated LLucass/PRESS_GRPO_2.0_beta_0.01
2B • Updated 2B • Updated LLucass/PRESS_GRPO_2.0_beta_0.001
2B • Updated LLucass/PRESS_GRPO_1.0_beta_0.001
2B • Updated LLucass/PRESS_GRPO_0.5_beta_0.001
2B • Updated 2B • Updated 2B • Updated 2B • Updated 2B • Updated 2B • Updated • 1
LLucass/qwen-math-7b-entropy-top1k
Updated
LLucass/Entropy-Maximization-All-Step2
8B • Updated LLucass/Entropy-Minimization-All-Step2
8B • Updated LLucass/Entropy-Maximization-Bot20-Step2
8B • Updated LLucass/FF_L0.2_H0.2_grpo
Text Generation
• 2B • Updated