Cheng Wang's picture

Cheng Wang

LLucass

·

https://wangcheng0116.github.io/

WangCheng0116

AI & ML interests

None yet

Organizations

LLucass 's models 114

LLucass/Tanh_PRESS_GRPO_1.0_beta_0.01_n_generations_12

2B • Updated Jun 22, 2025 • 1

LLucass/Tanh_PRESS_GRPO_4.0_beta_0.01_n_generations_12

2B • Updated Jun 22, 2025

LLucass/Tanh_PRESS_GRPO_0.5_beta_0.01_n_generations_12

2B • Updated Jun 22, 2025

LLucass/PRESS_GRPO_2.0_beta_0.01_n_generation_12

2B • Updated Jun 22, 2025

LLucass/GRPO_beta_0.01_n_generation_12

2B • Updated Jun 22, 2025

LLucass/Tanh_PRESS_GRPO_2.0_beta_0.04

2B • Updated Jun 22, 2025 • 1

LLucass/Tanh_PRESS_GRPO_1.0_beta_0.04

2B • Updated Jun 22, 2025

LLucass/Tanh_PRESS_GRPO_2.0_beta_0.01

2B • Updated Jun 22, 2025

LLucass/ACC_GRPO_beta_0.01

2B • Updated Jun 22, 2025 • 1

LLucass/ACC_PRESS_GRPO_2.0_beta_0.01

2B • Updated Jun 22, 2025

LLucass/PRESS_GRPO_4.0_beta_0.01

2B • Updated Jun 22, 2025

LLucass/PRESS_GRPO_2.0_beta_0.01

2B • Updated Jun 22, 2025

LLucass/GRPO_beta_0.01

2B • Updated Jun 22, 2025

LLucass/PRESS_GRPO_2.0_beta_0.001

2B • Updated Jun 21, 2025

LLucass/PRESS_GRPO_1.0_beta_0.001

2B • Updated Jun 21, 2025

LLucass/GRPO_beta

Updated Jun 21, 2025

LLucass/PRESS_GRPO_0.5_beta_0.001

2B • Updated Jun 21, 2025

LLucass/GRPO_beta_0.001

2B • Updated Jun 21, 2025

LLucass/PRESS_GRPO_0.2

2B • Updated Jun 21, 2025

LLucass/PRESS_GRPO_4.0

Updated Jun 21, 2025

LLucass/PRESS_GRPO_2.0

2B • Updated Jun 21, 2025

LLucass/PRESS_GRPO_1.5

Updated Jun 21, 2025

LLucass/PRESS_GRPO_1.0

2B • Updated Jun 21, 2025

LLucass/PRESS_GRPO_0.5

2B • Updated Jun 21, 2025 • 1

LLucass/DR_GRPO

Updated Jun 21, 2025

LLucass/qwen-math-7b-entropy-top1k

Updated Jun 17, 2025

LLucass/Entropy-Maximization-All-Step2

8B • Updated Jun 14, 2025

LLucass/Entropy-Minimization-All-Step2

8B • Updated Jun 14, 2025

LLucass/Entropy-Maximization-Bot20-Step2

8B • Updated Jun 14, 2025

LLucass/FF_L0.2_H0.2_grpo

Text Generation • 2B • Updated Jun 13, 2025