arxiv:2601.22975
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
about 17 hours ago
PhyCritic: Multimodal Critic Models for Physical AI
updated
a dataset
7 days ago
OpenRLHF/aime-2024
updated
a dataset
7 days ago
OpenRLHF/dapo-math-17k