jackw
hg2wzh
AI & ML interests
None yet
Recent Activity
liked a model 2 days ago
Qwen/Qwen3-VL-Embedding-2B updated a collection 2 days ago
Embedding liked a Space 10 days ago
TIGER-Lab/MMEB-LeaderboardOrganizations
None yet
Datasets
Embedding
-
nvidia/MM-Embed
8B • Updated • 542 • 65 -
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Paper • 2412.08802 • Published • 5 -
nvidia/NV-Embed-v2
Feature Extraction • 8B • Updated • 55.9k • 506 -
Qwen/Qwen3-VL-Embedding-2B
Feature Extraction • 2B • Updated • 1.2M • • 350
VLMs
-
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 79 -
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper • 2412.08635 • Published • 49 -
AIDC-AI/Ovis2-2B
Image-Text-to-Text • Updated • 2.16k • 60 -
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text • 2B • Updated • 7.73k • 20
Text-to-Image
Datasets
Reasoning
Embedding
-
nvidia/MM-Embed
8B • Updated • 542 • 65 -
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Paper • 2412.08802 • Published • 5 -
nvidia/NV-Embed-v2
Feature Extraction • 8B • Updated • 55.9k • 506 -
Qwen/Qwen3-VL-Embedding-2B
Feature Extraction • 2B • Updated • 1.2M • • 350
CLIP series
VLMs
-
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 79 -
Multimodal Latent Language Modeling with Next-Token Diffusion
Paper • 2412.08635 • Published • 49 -
AIDC-AI/Ovis2-2B
Image-Text-to-Text • Updated • 2.16k • 60 -
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text • 2B • Updated • 7.73k • 20
LLMs