facebook/dinov3-vitb16-pretrain-lvd1689m Image Feature Extraction • 85.7M • Updated Aug 19, 2025 • 1.5M • 115
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated Aug 19, 2025 • 14.7k • 222
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability Paper • 2504.07416 • Published Apr 10, 2025 • 3
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 30.2M • • 1.28k
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 16 items • Updated Mar 2 • 83