Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ibm-granite 's Collections
Granite 4.1 Language Models
Granite Docling
Granite 4.0 Language Models
Granite 4.0 Nano Language Models
Granite Embedding
Granite Speech
Granite Vision
Granite Guardian
Granite Time Series
Granite Libraries
Granite 3.3
Granite Geospatial Models
Granite Data
Granite Experiments
Granite Quantized Models

Granite Vision

updated 4 days ago

Multimodal models built for visual document analysis and image understanding.

Upvote
40

  • Running on Zero
    Agents
    40

    Multimodal RAG with Granite Vision

    🚀
    40

    RAG example using Granite [vision, embedding, instruct]


  • ibm-granite/granite-vision-4.1-4b

    Image-Text-to-Text • 4B • Updated 4 days ago • 6.76k • 47

  • ibm-granite/granite-vision-3.3-2b-embedding

    Feature Extraction • 3B • Updated Aug 16, 2025 • 69 • 27

  • ibm-granite/granite-vision-3.1-2b-preview

    Image-Text-to-Text • Updated Jun 12, 2025 • 1.18k • 113

  • ibm-granite/granite-vision-3.3-2b

    Image-to-Text • 3B • Updated Apr 2 • 107k • 83

  • ibm-granite/granite-4.0-3b-vision

    Image-Text-to-Text • 4B • Updated 3 days ago • 159k • 109

  • ibm-granite/granite-vision-3.2-2b

    Image-Text-to-Text • 3B • Updated Apr 2 • 4.66k • 122
Upvote
40
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs