Yingjie Lei's picture

Yingjie Lei

ChaceLei2004

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

upvoted a paper about 2 hours ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

upvoted a paper about 8 hours ago

QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation

View all activity

Organizations

None yet

upvoted 2 papers about 2 hours ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published 5 days ago • 98

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published 12 days ago • 112

upvoted a paper about 8 hours ago

QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation

Paper • 2604.08570 • Published 26 days ago • 123

upvoted a paper 3 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 11 days ago • 238

upvoted a paper 12 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 14 days ago • 200

upvoted a paper 16 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 21 days ago • 340

upvoted a paper about 2 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 144

upvoted 11 papers 2 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 195

UI-Venus-1.5 Technical Report

Paper • 2602.09082 • Published Feb 9 • 157

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 202

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Paper • 2602.07845 • Published Feb 8 • 71

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Paper • 2602.06949 • Published Feb 6 • 37

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published Feb 3 • 59

RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published Feb 5 • 27

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 99

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 268

upvoted 2 papers 3 months ago

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Paper • 2602.03796 • Published Feb 3 • 64

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published Jan 31 • 323