arxiv:2407.18418
Wen
Byanka
·
AI & ML interests
None yet
Organizations
models
30
Byanka/vgrpo-hotpot1_1.5B-Instruct
Text Generation
•
2B
•
Updated
•
2
Byanka/confgrpo-hotpot_1-1.5B-Instruct_new
Updated
Byanka/RLVR-hotpot1_1.5B-Instruct
Text Generation
•
2B
•
Updated
•
2
Byanka/RLVR-hotpot_3b
Text Generation
•
3B
•
Updated
•
2
Byanka/RLCR-hotpot_1-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
2
Byanka/RLVR-hotpot1_1.5B
Text Generation
•
2B
•
Updated
•
2
Byanka/RLCR-hotpot_1-1.5B
Text Generation
•
2B
•
Updated
•
2
Byanka/RLCR-hotpot_3b
Updated
Byanka/RLCR-hotpot
Text Generation
•
8B
•
Updated
•
1
Byanka/RLVR-hotpot
Text Generation
•
8B
•
Updated
•
1