4 1618

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

upvoted a paper about 21 hours ago

SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning

upvoted a paper about 21 hours ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

View all activity

Organizations

None yet

upvoted 3 papers about 21 hours ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published 19 days ago • 13

SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning

Paper • 2603.22057 • Published 22 days ago • 46

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published 28 days ago • 109

upvoted 2 papers about 23 hours ago

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Paper • 2603.21289 • Published 23 days ago • 35

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 21 days ago • 26

upvoted 8 papers 1 day ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Paper • 2603.24844 • Published 20 days ago • 10

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 22 days ago • 9

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Paper • 2603.21606 • Published 22 days ago • 39

Scalable Prompt Routing via Fine-Grained Latent Task Discovery

Paper • 2603.19415 • Published 26 days ago • 7

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 21 days ago • 62

Understanding the Challenges in Iterative Generative Optimization with LLMs

Paper • 2603.23994 • Published 20 days ago • 28

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Paper • 2603.26664 • Published 18 days ago • 9

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 20 days ago • 130

upvoted 7 papers 2 days ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published 22 days ago • 55

ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

Paper • 2603.24621 • Published 21 days ago • 2

Agentic AI and the next intelligence explosion

Paper • 2603.20639 • Published 25 days ago • 10

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published 13 days ago • 31

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published 13 days ago • 93

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Paper • 2604.00917 • Published 14 days ago • 18

AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks

Paper • 2604.01487 • Published 14 days ago • 10

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity