Spaces:

Gamahea
/

lemm-test-100

Running on Zero

App Files Files Community

lemm-test-100 / README.md

Gamahea

Add ZeroGPU authentication requirements

e1e9d05 verified about 3 hours ago

preview code

raw

history blame contribute delete

2.68 kB

	---
	title: Music Generation Studio
	emoji: 🎵
	colorFrom: purple
	colorTo: pink
	sdk: gradio
	sdk_version: 4.44.0
	app_file: app.py
	pinned: false
	license: mit
	---

	# 🎵 Music Generation Studio

	Create AI-powered music with intelligent prompt analysis and context-aware generation using DiffRhythm2 and LyricMind AI.

	⚠️ Important:
	- This Space requires ZeroGPU to run
	- You must be logged in to HuggingFace to use GPU features
	- Free users get daily ZeroGPU quota - check your usage at https://huggingface.co/settings/billing
	- If you see quota errors while logged in, try duplicating this Space to your account

	## Features

	- Intelligent Music Generation: DiffRhythm2 model for high-quality music with vocals
	- Smart Lyrics Generation: LyricMind AI for context-aware lyric creation
	- Prompt Analysis: Automatically detects genre, BPM, and mood from your description
	- Flexible Vocal Modes:
	- Instrumental: Pure music without vocals
	- User Lyrics: Provide your own lyrics
	- Auto Lyrics: AI-generated lyrics based on prompt
	- Timeline Management: Build complete songs clip-by-clip
	- Export: Download your creations in WAV, MP3, or FLAC formats

	## How to Use

	1. Generate Music:
	- Enter a descriptive prompt (e.g., "energetic rock song with electric guitar at 140 BPM")
	- Choose vocal mode (Instrumental, User Lyrics, or Auto Lyrics)
	- Set duration (10-120 seconds)
	- Click "Generate Music Clip"

	2. Manage Timeline:
	- View all generated clips in the timeline
	- Remove specific clips or clear all
	- Clips are arranged sequentially

	3. Export:
	- Enter a filename
	- Choose format (WAV recommended for best quality)
	- Download your complete song

	## Models

	- DiffRhythm2: Music generation with integrated vocals ([ASLP-lab/DiffRhythm2](https://huggingface.co/ASLP-lab/DiffRhythm2))
	- MuQ-MuLan: Music style encoding ([OpenMuQ/MuQ-MuLan-large](https://huggingface.co/OpenMuQ/MuQ-MuLan-large))

	## Performance

	⏱️ Generation time: ~2-4 minutes per 30-second clip on CPU (HuggingFace Spaces free tier)

	💡 Tip: Start with shorter durations (10-20 seconds) for faster results

	## Technical Details

	- Built with Gradio and PyTorch
	- Uses DiffRhythm2 for music generation with vocals
	- Employs flow-matching techniques for high-quality audio synthesis
	- Supports multiple languages for lyrics (English, Chinese, Japanese)

	## Credits

	- DiffRhythm2 by ASLP-lab
	- MuQ-MuLan by OpenMuQ
	- Application interface and integration by Music Generation App Team

	## License

	MIT License - See LICENSE file for details