Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
afondiel 's Collections
Computer Vision Challenge
Edge-AI
Vision
Audio
Video
Autonomous Systems
Cultural AI
Language
Multimodality

Vision

updated Oct 23, 2024
Upvote
-

  • An-619/FastSAM

    Updated Jun 22, 2023 • 60

  • black-forest-labs/FLUX.1-dev

    Text-to-Image • Updated Jun 27, 2025 • 766k • • 12.5k

  • black-forest-labs/FLUX.1-schnell

    Text-to-Image • Updated Aug 16, 2024 • 725k • • 4.71k

  • google/owlvit-base-patch32

    Zero-Shot Object Detection • 0.2B • Updated Dec 12, 2023 • 157k • 146

  • openai/clip-vit-base-patch32

    Zero-Shot Image Classification • Updated Feb 29, 2024 • 20.3M • 889

  • llava-hf/vip-llava-7b-hf

    Image-Text-to-Text • 7B • Updated Jan 27, 2025 • 1.15k • 16

  • mistral-community/pixtral-12b-240910

    Image-Text-to-Text • Updated Oct 1, 2024 • 3.29k • 381

  • microsoft/Phi-3-vision-128k-instruct

    Text Generation • Updated Dec 10, 2025 • 93k • 970
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs