Transform video frames using text instructions
Train a custom video model
Image to 3D with DPT + 3D Point Cloud
Generate 3D depth map visualization from an image