Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Skier8402 's Collections
Guides
Interpretability tools
translation
OCR
biomedical
Browser-agents
Realtime-apps
Leaderboards
Quantization tools
3Dmodels
Reasoning-models
Embedding models
Swahili models
multimodal
Diffusion model tools
metrics
RAG-agents
Speech apps
Prompts
Interesting finds
Chat-agents
Datasets
LLM-transparency-tools
Data creation
Computer vision

Computer vision

updated Mar 25, 2025

Image and video models

Upvote
-

  • Runtime error
    Featured
    198

    Better Florence 2

    πŸ”₯
    198

    Analyze images to detect objects, generate captions, or perform OCR


  • Runtime error
    34

    EfficientSAM vs SAM

    βš”
    34


  • Runtime error
    31

    Llava Interleave

    πŸŒ‹
    31

    Generate answers by uploading images or videos


  • Running on Zero
    1.78k

    DALLE 3 XL v2

    πŸ”₯
    1.78k

    Generate high‑resolution images from text prompts


  • Running on Zero
    140

    Segment Anything 2

    πŸ”₯
    140

    Generate object masks on images with SAM2


  • Runtime error
    Featured
    515

    Florence2 + SAM2

    πŸ”₯
    515

    Segment and caption objects in images and videos


  • Running on T4
    125

    RF-DETR

    πŸ”₯
    125

    SOTA real-time object detection model

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs