Wei Liu's picture

Wei Liu

lefutonku

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Memento-Skills: Let Agents Design Agents

upvoted a paper 1 day ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

upvoted a paper 1 day ago

Position: Agentic Evolution is the Path to Evolving LLMs

View all activity

Organizations

None yet

upvoted 3 papers 1 day ago

Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published 15 days ago • 56

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published 4 days ago • 74

Position: Agentic Evolution is the Path to Evolving LLMs

Paper • 2602.00359 • Published Jan 30 • 7

upvoted 7 papers 2 days ago

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Paper • 2603.19228 • Published 15 days ago • 67

Make Geometry Matter for Spatial Reasoning

Paper • 2603.26639 • Published 7 days ago • 29

A Matter of Time: Revealing the Structure of Time in Vision-Language Models

Paper • 2510.19559 • Published Oct 22, 2025 • 1

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published 4 days ago • 51

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Paper • 2511.12884 • Published Nov 17, 2025 • 28

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published 28 days ago • 44

LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Paper • 2603.19312 • Published 20 days ago • 18

upvoted 2 papers 3 days ago

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Paper • 2603.25502 • Published 8 days ago • 55

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 23 days ago • 147

upvoted 3 papers 8 days ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 11 days ago • 46

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published 10 days ago • 90

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 11 days ago • 120

upvoted 2 papers 11 days ago

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 33

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published 15 days ago • 94

upvoted a paper 14 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 18 days ago • 152

upvoted a paper 16 days ago

Attention Residuals

Paper • 2603.15031 • Published 18 days ago • 171

upvoted a paper 25 days ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 32