Bo Wang's picture

Bo Wang

Musicode

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

upvoted a paper about 4 hours ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

upvoted a paper about 22 hours ago

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections

View all activity

Organizations

None yet

submitted a paper to Daily Papers about 22 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 2 days ago • 87

authored 2 papers 9 months ago

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Paper • 2412.14135 • Published Dec 18, 2024

In-Memory Learning: A Declarative Learning Framework for Large Language Models

Paper • 2403.02757 • Published Mar 5, 2024