Qiyuan Zhang's picture

Qiyuan Zhang PRO

DonJoey

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

updated a collection 3 days ago

updated a collection 3 days ago

View all activity

Organizations

None yet

upvoted a collection 3 days ago

RubricBench

2 items • Updated 3 days ago • 2

updated a collection 3 days ago

RubricBench

2 items • Updated 3 days ago • 2

upvoted a collection 3 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 3 days ago • 1

upvoted a paper 3 days ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published 5 days ago • 32

updated a collection 3 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 3 days ago • 1

submitted a paper to Daily Papers 3 days ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published 5 days ago • 32

authored 2 papers 4 days ago

From Verifiable Dot to Reward Chain: Harnessing Verifiable Reference-based Rewards for Reinforcement Learning of Open-ended Generation

Paper • 2601.18533 • Published Jan 26

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 5 days ago • 51

updated a model 4 days ago

DonJoey/mix-grm-qwen3-8b-rl

8B • Updated 4 days ago • 57

upvoted a paper 4 days ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 5 days ago • 51

submitted a paper to Daily Papers 4 days ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 5 days ago • 51

published 2 datasets 6 days ago

DonJoey/mix-grm-sft-9k

Viewer • Updated 8 days ago • 8.99k • 3

DonJoey/mix-grm-rl-21k

Viewer • Updated 8 days ago • 21.9k • 3

updated a collection 6 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 3 days ago • 1

updated a model 6 days ago

DonJoey/mix-grm-qwen3-8b-sft

Updated 6 days ago • 19

published a model 6 days ago

DonJoey/mix-grm-qwen3-8b-sft

Updated 6 days ago • 19

updated a collection 6 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 3 days ago • 1

published a model 6 days ago

DonJoey/mix-grm-qwen3-8b-rl

8B • Updated 4 days ago • 57

published a dataset 6 days ago

DonJoey/rubricbench

Viewer • Updated 6 days ago • 1.15k • 28 • 2