Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
4
wang
wzx111
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
published
a model
6 days ago
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
updated
a model
6 days ago
wzx111/14B-Aggressive-GSPO-LR2e-6-G32
View all activity
Organizations
None yet
wzx111
's models
10
Sort: Recently updated
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
Updated
6 days ago
wzx111/14B-Aggressive-GSPO-LR2e-6-G32
Updated
6 days ago
wzx111/Qwen3-1.7B-GRPO-math
Updated
Nov 29, 2025
wzx111/Qwen3-1.7B-Open-R1-ADPO
Text Generation
•
2B
•
Updated
Nov 23, 2025
•
1
wzx111/Qwen3-1.7B-Open-R1-GRPO-Baseline
Text Generation
•
2B
•
Updated
Nov 22, 2025
•
1
wzx111/Qwen3-1.7B-Open-R1-GRPO
2B
•
Updated
May 14, 2025
wzx111/Qwen3-1.7B-Open-R1-GDPO-epcoh_
Text Generation
•
2B
•
Updated
May 14, 2025
wzx111/Qwen3-1.7B-MATH-GDPO-EPOCH2
Text Generation
•
2B
•
Updated
May 2, 2025
•
1
wzx111/Qwen3-1.7B-MATH-GDPO
Text Generation
•
2B
•
Updated
May 1, 2025
•
5
•
1
wzx111/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Apr 28, 2025