Generate depth map from an image
Transcribe or translate audio and YouTube videos to text
Generate depth maps and 3D views from photos