Yifan Wang's picture

Yifan Wang

AmberYifan

·

AI & ML interests

None yet

Recent Activity

authored a paper about 7 hours ago

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control

upvoted a paper about 18 hours ago

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control

published a model 4 months ago

AmberYifan/Qwen2.5-3B-MATH-MARL-structure-only

View all activity

Organizations

authored a paper about 7 hours ago

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control

Paper • 2604.26326 • Published 4 days ago • 11

authored 4 papers 7 months ago

LLMs Can Get "Brain Rot"!

Paper • 2510.13928 • Published Oct 15, 2025 • 23

Cascade Reward Sampling for Efficient Decoding-Time Alignment

Paper • 2406.16306 • Published Jun 24, 2024 • 1

DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning

Paper • 2510.02341 • Published Sep 27, 2025 • 4

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

Paper • 2504.02193 • Published Apr 3, 2025 • 1