Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
23
3
Jiajie Zhang
NeoZ123
Follow
adineh138's profile picture
LighterDarkness's profile picture
sbrandeis's profile picture
13 followers
·
2 following
Neo-Zhangjiajie
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
submitted
a paper
about 1 month ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
published
a dataset
about 1 month ago
THU-KEG/CaRR-DeepDive
View all activity
Organizations
NeoZ123
's models
2
Sort: Recently updated
NeoZ123/LongReward-llama3.1-8b-SFT
Text Generation
•
9B
•
Updated
Oct 29, 2024
•
9
•
1
NeoZ123/LongReward-glm4-9b-SFT
Text Generation
•
9B
•
Updated
Oct 29, 2024
•
4