PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 219
Moonshine: Speech Recognition for Live Transcription and Voice Commands Paper • 2410.15608 • Published Oct 21, 2024 • 12
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published Dec 12, 2025 • 39
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale Paper • 2602.23361 • Published 22 days ago • 14
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models Paper • 2601.22060 • Published Jan 29 • 155
VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published Feb 8 • 124