1 13 3

Tong He

tonghe90

http://tonghe90.github.io

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI

Recent Activity

upvoted a paper 4 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

upvoted a paper 10 days ago

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

upvoted a paper 3 months ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

View all activity

Organizations

upvoted a paper 4 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published 8 days ago • 48

upvoted a paper 10 days ago

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 10 days ago • 90

upvoted 2 papers 3 months ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published Jan 5 • 30

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published Dec 26, 2025 • 61

upvoted a paper 6 months ago

BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

Paper • 2509.25077 • Published Sep 29, 2025 • 15

upvoted a paper 7 months ago

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Paper • 2509.15185 • Published Sep 18, 2025 • 29

authored 5 papers 7 months ago

upvoted a paper 7 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 107

liked a dataset 7 months ago

InternRobotics/OmniWorld

Viewer • Updated 13 days ago • 6.35B • 31.5k • 89

liked a model 7 months ago

facebook/MobileLLM-R1-950M

Text Generation • 0.9B • Updated Sep 30, 2025 • 199 • 282

upvoted a paper 7 months ago

WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool

Paper • 2509.05296 • Published Sep 5, 2025 • 8

upvoted 2 papers 8 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4, 2025 • 138

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23, 2025 • 92

authored 2 papers 9 months ago

Aether: Geometric-Aware Unified World Modeling

Paper • 2503.18945 • Published Mar 24, 2025 • 28

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 67

upvoted a paper 9 months ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 67

Tong He

AI & ML interests

Recent Activity

Organizations

tonghe90's activity