20 36 22

Zhenxiong Tan PRO

Yuanshi

AI & ML interests

Reinforcement Learning; Large Language Model; Multimodality; AI Infrastructure;

Recent Activity

upvoted a paper 12 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

upvoted a paper 13 days ago

Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

authored a paper 13 days ago

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

View all activity

Organizations

upvoted a paper 12 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published 14 days ago • 183

upvoted 2 papers 13 days ago

Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Paper • 2603.15557 • Published 14 days ago • 28

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published 14 days ago • 24

upvoted a collection 19 days ago

MiroThinker-1.7

Collection

2 items • Updated 19 days ago • 54

upvoted a paper about 2 months ago

dVoting: Fast Voting for dLLMs

Paper • 2602.12153 • Published Feb 12 • 21

upvoted 4 papers 3 months ago

upvoted 4 papers 4 months ago

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 46

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 54

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 194

upvoted a paper 5 months ago

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 51

upvoted 2 papers 6 months ago

MixReasoning: Switching Modes to Think

Paper • 2510.06052 • Published Oct 7, 2025 • 22

dParallel: Learnable Parallel Decoding for dLLMs

Paper • 2509.26488 • Published Sep 30, 2025 • 19

upvoted a paper 9 months ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7, 2025 • 67

upvoted 3 papers 10 months ago

Test3R: Learning to Reconstruct 3D at Test Time

Paper • 2506.13750 • Published Jun 16, 2025 • 27

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43

Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published Jun 4, 2025 • 24

Zhenxiong Tan PRO

AI & ML interests

Recent Activity

Organizations

Yuanshi's activity