Sijia Cui's picture

In a Training Loop 🔄

Sijia Cui

cuisijia

·

https://github.com/SijiaCui

AI & ML interests

None yet

Recent Activity

authored a paper 8 days ago

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

upvoted a collection 15 days ago

liked a dataset 20 days ago

rafaelpadilla/coco2017

View all activity

Organizations

authored a paper 8 days ago

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

Paper • 2603.10101 • Published 21 days ago • 5