Vatsal Agarwal's picture

Vatsal Agarwal

vatsalag

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

authored a paper 1 day ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

upvoted a paper 1 day ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

View all activity

Organizations

authored 2 papers 1 day ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

Paper • 2602.18434 • Published 26 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published 4 days ago • 9

authored 4 papers 8 months ago

Do text-free diffusion models learn discriminative visual representations?

Paper • 2311.17921 • Published Nov 29, 2023 • 1

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 19

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Paper • 2409.06703 • Published Sep 10, 2024 • 3

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Paper • 2507.07106 • Published Jul 9, 2025 • 2