Jiajie Zhang's picture

Jiajie Zhang

NeoZ123

·

Neo-Zhangjiajie

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

submitted a paper about 1 month ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

published a dataset about 1 month ago

THU-KEG/CaRR-DeepDive

View all activity

Organizations

NeoZ123 's models 2

NeoZ123/LongReward-llama3.1-8b-SFT

Text Generation • 9B • Updated Oct 29, 2024 • 9 • 1

NeoZ123/LongReward-glm4-9b-SFT

Text Generation • 9B • Updated Oct 29, 2024 • 4