PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 7.15k • 1.58k
Generate art prompts and style tags from any image
Chat with an image using Phi-3 Vision model
Extract text from images using OCR technology
Upscale low‑resolution images to higher resolution
Remove image backgrounds and get transparent PNGs
Generate and convert voice using text and audio inputs