Generate descriptions from images and text prompts
Ask questions about images and get instant answers
A Foundation Action Model For Generalist GUI Agents