6 23 8

Wei Huang

AaronHuangWei

AI & ML interests

Deep Learning; Efficient AI; Quantization; LLMs

Recent Activity

updated a model 6 days ago

AaronHuangWei/streamingomni-sft-stage1-multimodal_compression_streamingonly-ckpt2400-final

published a model 6 days ago

AaronHuangWei/streamingomni-sft-stage1-multimodal_compression_streamingonly-ckpt2400-final

updated a model 7 days ago

AaronHuangWei/streamingomni-sft-stage1-multimodal_compression_streamingonly-ckpt1600-final

View all activity

Organizations

upvoted 2 papers 19 days ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published 20 days ago • 35

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 20 days ago • 110

upvoted a paper 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

upvoted a paper 5 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

upvoted 4 papers 6 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

upvoted a paper 7 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

upvoted a paper 9 months ago

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Paper • 2508.05635 • Published Aug 7, 2025 • 73

upvoted 3 papers 10 months ago

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14, 2025 • 37

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11, 2025 • 62

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162

upvoted 4 papers over 1 year ago

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Paper • 2411.14794 • Published Nov 22, 2024 • 13

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14, 2024 • 51

MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More

Paper • 2410.06270 • Published Oct 8, 2024 • 1

Can OOD Object Detectors Learn from Foundation Models?

Paper • 2409.05162 • Published Sep 8, 2024 • 9

upvoted 3 papers almost 2 years ago

SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix

Paper • 2407.00367 • Published Jun 29, 2024 • 11

What Matters in Detecting AI-Generated Videos like Sora?

Paper • 2406.19568 • Published Jun 27, 2024 • 15

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

Paper • 2405.14917 • Published May 23, 2024 • 1

Wei Huang

AI & ML interests

Recent Activity

Organizations

AaronHuangWei's activity