1 34 14

Ruihan Yang

rhyang2021

https://github.com/rhyang2021

rhyang2021

AI & ML interests

NLP, Agent Learning, Uncertainty

Recent Activity

upvoted a paper 4 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

upvoted a collection 4 days ago

AI Lab

upvoted a paper about 1 month ago

WideSeek: Advancing Wide Research via Multi-Agent Scaling

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 7 days ago • 102

upvoted a collection 4 days ago

AI Lab

Collection

4 items • Updated 3 days ago • 10

upvoted 3 papers about 1 month ago

WideSeek: Advancing Wide Research via Multi-Agent Scaling

Paper • 2602.02636 • Published Feb 2 • 15

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published Jan 28 • 15

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 255

upvoted a paper 2 months ago

Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published Jan 5 • 17

liked a dataset 5 months ago

rhyang2021/UNCLE

Viewer • Updated Oct 9, 2025 • 1.07k • 15 • 1

updated a dataset 5 months ago

rhyang2021/UNCLE

Viewer • Updated Oct 9, 2025 • 1.07k • 15 • 1

published a dataset 5 months ago

rhyang2021/UNCLE

Viewer • Updated Oct 9, 2025 • 1.07k • 15 • 1

upvoted a paper 5 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

upvoted a paper 6 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 82

liked a model 7 months ago

Alibaba-NLP/WebDancer-32B

Text Generation • Updated Jun 26, 2025 • 15 • • 57

liked a model 8 months ago

MASWorks/MAS-GPT-32B

Text Generation • 33B • Updated Jul 14, 2025 • 4 • 4

upvoted 2 papers 8 months ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 59

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 20

upvoted 2 papers 9 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 338

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

upvoted a collection 9 months ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 28 days ago • 118

upvoted 2 papers 9 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

ATLaS: Agent Tuning via Learning Critical Steps

Paper • 2503.02197 • Published Mar 4, 2025 • 9

Ruihan Yang

AI & ML interests

Recent Activity

Organizations

rhyang2021's activity