13 6

Shiyu Feng

mziyiw

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper about 22 hours ago

Adam's Law: Textual Frequency Law on Large Language Models

upvoted a paper 1 day ago

AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 11 hours ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 4 days ago • 226

upvoted a paper about 22 hours ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 10 days ago • 347

upvoted a paper 1 day ago

AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning

Paper • 2604.05846 • Published 5 days ago • 5

liked a model 3 days ago

kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E3

Text Generation • 0.6B • Updated 3 days ago • 191 • 1

liked a model 8 days ago

alkav/fsake-checkpoints

Updated 8 days ago • 1

upvoted a paper 11 days ago

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published 18 days ago • 182

liked a dataset 11 days ago

NerveGear/tst

Updated 6 days ago • 74 • 1

liked a model 11 days ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 237k • • 2.46k

upvoted 2 papers 23 days ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published about 1 month ago • 144

Demystifing Video Reasoning

Paper • 2603.16870 • Published 25 days ago • 367

upvoted a paper about 1 month ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published Mar 6 • 92

liked a dataset about 1 month ago

LeeXiangNO1/DyNativeGaussian_sequence

Preview • Updated 19 days ago • 6.04k • 53

upvoted 2 papers about 1 month ago

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published Feb 26 • 201

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151

upvoted a paper about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

liked a model about 2 months ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 28 days ago • 815k • • 1.43k

upvoted 3 papers about 2 months ago

Shiyu Feng

AI & ML interests

Recent Activity

Organizations

mziyiw's activity