Mashiro's picture

9

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

upvoted a paper about 1 month ago

RM-R1: Reward Modeling as Reasoning

upvoted a paper about 2 months ago

Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet