Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qwen Pilot's picture
4

Qwen Pilot

QwenPilot
eac123's profile picture Enigrand's profile picture philipp-zettl's profile picture
·
  • qwenpilot

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
upvoted a paper about 20 hours ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
upvoted a paper about 20 hours ago
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
View all activity

Organizations

None yet

upvoted 3 papers about 20 hours ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 9 days ago • 28

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 9 days ago • 7
upvoted a paper 1 day ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 13 days ago • 288
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs