VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published Feb 8 • 124
VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published Feb 8 • 124
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Paper • 2503.07038 • Published Mar 10, 2025
EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Paper • 2405.18065 • Published May 28, 2024
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention Paper • 2602.01801 • Published Feb 2 • 28