Generate 3D models from 2D images with mask control
Upload and process videos with LLM commands
Sound effect from description