Separate audio into stems using various models
Generate audio from text using voice prompts
Generate and convert voice using text and audio inputs