Perceptual-Evidence Anchored Reinforced Learning for Multimodal Reasoning
Zhangchi
Rex1090
AI & ML interests
None yet
Recent Activity
upvoted a paper about 18 hours ago
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence updated a model about 2 months ago
Rex1090/PEARL-8B upvoted a paper about 2 months ago
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVROrganizations
None yet