Transcribe audio files into text
Generate images from text with Stable Diffusion
Generate images from text descriptions