Generate Vietnamese speech from text
Generate vivid images from text prompts in seconds
OmniParser, turn your LLM into GUI agent