VL-Thinking-Data PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 16 • 1
PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 16 • 1
MLLMDataV1 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 22 • 3 pengshuai-rin/multimath-300k Viewer • Updated Aug 20, 2024 • 1.19M • 128 • 11 OpenFace-CQUPT/HumanCaption-HQ-311K Viewer • Updated Jun 9, 2025 • 313k • 76 • 17 remyxai/vqasynth_spacellava Viewer • Updated Oct 24, 2024 • 28k • 73 • 14
MLLM-08 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 22 • 3
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133
MLLMDataV2 DAMO-NLP-SG/multimodal_textbook Updated Mar 17, 2025 • 1.1k • 154 zwq2018/Multi-modal-Self-instruct Viewer • Updated Jan 27, 2025 • 76k • 686 • 31 taesiri/GameplayCaptions-Gemini-pro-vision Viewer • Updated Apr 7, 2024 • 70.7k • 163 • 6 5CD-AI/LLaVA-CoT-o1-Instruct Viewer • Updated Nov 27, 2024 • 58.5k • 43 • 109
EfficientLLM Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 56
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 56
VL-Thinking-Data PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 16 • 1
PJMixers-Images/bghira_pseudo-camera-10k-gemini-2.0-flash-thinking-exp-1219-CustomShareGPT Viewer • Updated Jan 17, 2025 • 2.96k • 16 • 1
MLLMDataV2 DAMO-NLP-SG/multimodal_textbook Updated Mar 17, 2025 • 1.1k • 154 zwq2018/Multi-modal-Self-instruct Viewer • Updated Jan 27, 2025 • 76k • 686 • 31 taesiri/GameplayCaptions-Gemini-pro-vision Viewer • Updated Apr 7, 2024 • 70.7k • 163 • 6 5CD-AI/LLaVA-CoT-o1-Instruct Viewer • Updated Nov 27, 2024 • 58.5k • 43 • 109
MLLMDataV1 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 22 • 3 pengshuai-rin/multimath-300k Viewer • Updated Aug 20, 2024 • 1.19M • 128 • 11 OpenFace-CQUPT/HumanCaption-HQ-311K Viewer • Updated Jun 9, 2025 • 313k • 76 • 17 remyxai/vqasynth_spacellava Viewer • Updated Oct 24, 2024 • 28k • 73 • 14
EfficientLLM Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 56
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper • 2402.19427 • Published Feb 29, 2024 • 56
MLLM-08 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133 zrchen03/math_data_ocr Viewer • Updated Mar 10, 2025 • 16.1k • 22 • 3
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 133