fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text โข 8B โข Updated โข 60.9k โข 329
Generate detailed captions for any image
Generate captions for images using text prompts
Generate synchronized audio for videos from text prompts
Generate depth map from your photo
Generate creative prompts for Stable Diffusion images
Generate detailed AI prompts and tags from an image
A unified multimodal understanding and generation model.
Launch an interactive demo interface
Chat with Gemini 2.5 to get detailed responses