Qwen Pilot's picture

4

Qwen Pilot

QwenPilot

·

qwenpilot

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

upvoted a paper about 20 hours ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

upvoted a paper about 20 hours ago

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

View all activity

Organizations

None yet

upvoted 3 papers about 20 hours ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 9 days ago • 28

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 9 days ago • 7

upvoted a paper 1 day ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 13 days ago • 288