lemm-test-100 / README.md
Gamahea's picture
Add ZeroGPU authentication requirements
e1e9d05 verified
---
title: Music Generation Studio
emoji: 🎵
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
---
# 🎵 Music Generation Studio
Create AI-powered music with intelligent prompt analysis and context-aware generation using DiffRhythm2 and LyricMind AI.
**⚠️ Important:**
- This Space requires ZeroGPU to run
- **You must be logged in** to HuggingFace to use GPU features
- Free users get daily ZeroGPU quota - check your usage at https://huggingface.co/settings/billing
- If you see quota errors while logged in, try duplicating this Space to your account
## Features
- **Intelligent Music Generation**: DiffRhythm2 model for high-quality music with vocals
- **Smart Lyrics Generation**: LyricMind AI for context-aware lyric creation
- **Prompt Analysis**: Automatically detects genre, BPM, and mood from your description
- **Flexible Vocal Modes**:
- Instrumental: Pure music without vocals
- User Lyrics: Provide your own lyrics
- Auto Lyrics: AI-generated lyrics based on prompt
- **Timeline Management**: Build complete songs clip-by-clip
- **Export**: Download your creations in WAV, MP3, or FLAC formats
## How to Use
1. **Generate Music**:
- Enter a descriptive prompt (e.g., "energetic rock song with electric guitar at 140 BPM")
- Choose vocal mode (Instrumental, User Lyrics, or Auto Lyrics)
- Set duration (10-120 seconds)
- Click "Generate Music Clip"
2. **Manage Timeline**:
- View all generated clips in the timeline
- Remove specific clips or clear all
- Clips are arranged sequentially
3. **Export**:
- Enter a filename
- Choose format (WAV recommended for best quality)
- Download your complete song
## Models
- **DiffRhythm2**: Music generation with integrated vocals ([ASLP-lab/DiffRhythm2](https://huggingface.co/ASLP-lab/DiffRhythm2))
- **MuQ-MuLan**: Music style encoding ([OpenMuQ/MuQ-MuLan-large](https://huggingface.co/OpenMuQ/MuQ-MuLan-large))
## Performance
⏱️ Generation time: ~2-4 minutes per 30-second clip on CPU (HuggingFace Spaces free tier)
💡 Tip: Start with shorter durations (10-20 seconds) for faster results
## Technical Details
- Built with Gradio and PyTorch
- Uses DiffRhythm2 for music generation with vocals
- Employs flow-matching techniques for high-quality audio synthesis
- Supports multiple languages for lyrics (English, Chinese, Japanese)
## Credits
- DiffRhythm2 by ASLP-lab
- MuQ-MuLan by OpenMuQ
- Application interface and integration by Music Generation App Team
## License
MIT License - See LICENSE file for details