pyc66 (pengyuchao)

liked a model 2 months ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 13 days ago • 7.15k • 1.58k

liked 7 models 3 months ago

liked a Space 3 months ago

Qwen Image Layered

🚀

486

Decompose images into editable layers and download them

liked 5 models 3 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.22M • 3.2k

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12, 2025 • 351k • 526

microsoft/layoutlm-base-uncased

0.1B • Updated Apr 16, 2024 • 137k • 62

openbmb/MiniCPM-V-2

Visual Question Answering • 3B • Updated Jan 15, 2025 • 77.2k • 495

zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 136k • • 1.95k

liked 6 Spaces 7 months ago

CLIP Interrogator

🕵

2.97k

Generate art prompts and style tags from any image

Microsoft Phi-3-Vision-128k

😻

219

Chat with an image using Phi-3 Vision model

OCR Image To Text

📸

182

Extract text from images using OCR technology

Flux.1-dev Upscaler

🔎

1.68k

Upscale low‑resolution images to higher resolution

Background Removal

🌘

2.79k

Remove image backgrounds and get transparent PNGs

Moe TTS

😊

667

Generate and convert voice using text and audio inputs

pengyuchao

AI & ML interests

Organizations

PaddlePaddle/PaddleOCR-VL

Qwen/Qwen-Image

hakurei/waifu-diffusion

deepseek-ai/DeepSeek-V3.2

apple/DepthPro

deepseek-ai/Janus-Pro-7B

impira/layoutlm-document-qa

google/deplot

Qwen Image Layered

deepseek-ai/DeepSeek-OCR

Qwen/Qwen2-Audio-7B-Instruct

microsoft/layoutlm-base-uncased

openbmb/MiniCPM-V-2

zai-org/GLM-4.7

CLIP Interrogator

Microsoft Phi-3-Vision-128k

OCR Image To Text

Flux.1-dev Upscaler

Background Removal

Moe TTS

pengyuchao

AI & ML interests

Organizations

pyc66's activity

Qwen Image Layered

CLIP Interrogator

Microsoft Phi-3-Vision-128k

OCR Image To Text

Flux.1-dev Upscaler

Background Removal

Moe TTS