4 124 6

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

AI & ML interests

Generative AI

Recent Activity

upvoted a paper about 13 hours ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

upvoted a paper 1 day ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

upvoted a paper 5 days ago

Context Unrolling in Omni Models

View all activity

Organizations

upvoted a paper about 13 hours ago

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Paper • 2604.24625 • Published 3 days ago • 23

upvoted a paper 1 day ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 3 days ago • 110

upvoted a paper 5 days ago

Context Unrolling in Omni Models

Paper • 2604.21921 • Published 7 days ago • 12

upvoted a paper 7 days ago

CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation

Paper • 2604.19636 • Published 9 days ago • 86

upvoted a paper 12 days ago

OneHOI: Unifying Human-Object Interaction Generation and Editing

Paper • 2604.14062 • Published 15 days ago • 8

upvoted a paper 14 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 15 days ago • 153

upvoted a paper 16 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 17 days ago • 70

upvoted a paper 18 days ago

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published 21 days ago • 77

upvoted 2 papers 22 days ago

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

Paper • 2604.04184 • Published 25 days ago • 50

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 24 days ago • 203

upvoted a paper 28 days ago

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published Mar 29 • 68

upvoted a paper 29 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 145

upvoted 3 papers about 1 month ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 62

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Paper • 2603.17117 • Published Mar 17 • 87

upvoted 4 papers about 2 months ago

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Paper • 2603.02210 • Published Mar 2 • 29

upvoted a paper 2 months ago

World Action Models are Zero-shot Policies

Paper • 2602.15922 • Published Feb 17 • 18

Donghao Zhou

AI & ML interests

Recent Activity

Organizations

donghao-zhou's activity