arxiv:2502.13943
Junjie Lu
Lux0926
AI & ML interests
None yet
Organizations
models 18
Lux0926/Qwen1.5-32B-SFT-CGPO
33B • Updated
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO
7B • Updated
Lux0926/Qwen2-7B-SFT-CGPO
8B • Updated • 1
Lux0926/MetaMath-Llama-8B-CGPO
8B • Updated • 1
Lux0926/MetaMath-Mistral-7B-CGPO
7B • Updated
Lux0926/MetaMath-Mistral-7B-Step-DPO
7B • Updated
Lux0926/MetaMath-Llama-8B-Step-DPO
8B • Updated
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO
7B • Updated
Lux0926/ASPRM-D-ORM
7B • Updated
Lux0926/LCD-DS
7B • Updated
datasets 16
Lux0926/Qwen2-7B-SFT-CGPO-10k
Viewer • Updated • 10.8k • 6
Lux0926/Qwen1.5-32B-SFT-CGPO-10k
Viewer • Updated • 10.8k • 5
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO-10k
Viewer • Updated • 10.8k • 4
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO-10k
Viewer • Updated • 10.6k • 7
Lux0926/MetaMath-Llama-8B-CGPO-10k
Viewer • Updated • 10.8k • 4
Lux0926/MetaMath-Mistral-7B-CGPO-10k
Viewer • Updated • 10.8k • 8
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview • Updated • 115
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview • Updated • 133
Lux0926/ASPRM-Math-Rollout-Result
Viewer • Updated • 215k • 10
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer • Updated • 99.8k • 12