·
AI & ML interests
None yet
Organizations
Lux0926/Qwen2-7B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 6
Lux0926/Qwen1.5-32B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 5
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 4
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO-10k
Viewer
• Updated • 10.6k • 7
Lux0926/MetaMath-Llama-8B-CGPO-10k
Viewer
• Updated • 10.8k • 4
Lux0926/MetaMath-Mistral-7B-CGPO-10k
Viewer
• Updated • 10.8k • 7
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview
• Updated • 114
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview
• Updated • 125
Lux0926/ASPRM-Math-Rollout-Result
Viewer
• Updated • 215k • 9
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer
• Updated • 99.8k • 11
Lux0926/ASPRM-MATHCODE-Mistral-Training-Dataset
Viewer
• Updated • 438k • 11
Lux0926/ASPRM-D-Training-Dataset
Viewer
• Updated • 49.9k • 2
Lux0926/ASPRM-L-Training-Dataset
Viewer
• Updated • 372k • 17
Lux0926/ASPRM-D-Training-Dataset-ORM
Viewer
• Updated • 49.9k • 7
Lux0926/ASPRM-M-Training-Dataset
Viewer
• Updated • 388k • 6
Lux0926/ASPRM-Code-Rollout-Result