RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback Paper โข 2507.15024 โข Published Jul 20, 2025 โข 14