daqi's picture

daqi

Sunshine8393

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a paper 2 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a collection 2 days ago

View all activity

Organizations

None yet

authored a paper 1 day ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 3 days ago • 5

upvoted a paper 2 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 3 days ago • 5

upvoted a collection 2 days ago

PRIMO R1

Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 3 items • Updated 2 days ago • 3

updated a dataset 4 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 6

published a dataset 4 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 6

updated a dataset 4 months ago

Sunshine8393/RoboTwinQA_new

Updated Nov 7, 2025 • 6