·
AI & ML interests
None yet
Organizations
None yet
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-Simple-step-150
8B • Updated
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-260
8B • Updated
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-90
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-100
8B • Updated
• 1
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-150
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-200
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-50
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-LR-step-50
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-RL-step-50
8B • Updated
• 3
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-Photo-All-step-300
8B • Updated
Allen-UQ/Qwen2.5-7B-Instruct-GRPO-Photo-Mixed-step-300
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Photo-Meaningless-Direct-step-550
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Photo-Meaningless-Direct-step-200
8B • Updated
• 1
Allen-UQ/Qwen2.5-7B-GRPO-Photo-Meaningless-step-100
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-Nei-step-60
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-Count-step-60
Allen-UQ/Qwen2.5-7B-GRPO-Cora-LD-step-40
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-Fruit-step-60
Allen-UQ/Qwen2.5-7B-GRPO-Cora-notarget-step-200
Allen-UQ/Qwen2.5-7B-GRPO-Cora-homo-step-120
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-homo-step-120
Allen-UQ/Qwen2.5-7B-Simple-Pubmed-step-10
8B • Updated
• 1
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-homo-step-90
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-homo-step-60
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-homo-step-30
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-label-id-step-40
8B • Updated
• 1
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-label-id-step-20
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-step-70
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-step-65
8B • Updated
Allen-UQ/Qwen2.5-7B-GRPO-Pubmed-step-60
8B • Updated