1 7 3

Bo Wang

Musicode

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

upvoted a paper about 4 hours ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

upvoted a paper about 22 hours ago

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections

View all activity

Organizations

None yet

upvoted 2 papers about 4 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 2 days ago • 87

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 2 days ago • 61

upvoted a paper about 22 hours ago

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections

Paper • 2507.00018 • Published Jun 15, 2025 • 1

submitted a paper to Daily Papers about 22 hours ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 2 days ago • 87

upvoted a collection 4 days ago

MOSS-VL

Collection

2 items • Updated 7 days ago • 49

upvoted a paper about 1 month ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56

liked 2 models 2 months ago

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 29.8k • 211

OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated 26 days ago • 46.9k • 376

upvoted a paper 6 months ago

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24, 2025 • 25

liked a model 7 months ago

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 37 • 19

authored 2 papers 9 months ago

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Paper • 2412.14135 • Published Dec 18, 2024

In-Memory Learning: A Declarative Learning Framework for Large Language Models

Paper • 2403.02757 • Published Mar 5, 2024

upvoted a paper 11 months ago

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Paper • 2505.20046 • Published May 26, 2025 • 18

Bo Wang

AI & ML interests

Recent Activity

Organizations

Musicode's activity