tencent/HunyuanImage-3.0
Text-to-Image β’ Updated β’ 115k β’ β’ 655
Generate expressive speech audio from text with custom voice
OmniParser, turn your LLM into GUI agent
Generate depth video from input video
Audio Conditioned LipSync with Latent Diffusion Models