Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,134 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model:
|
| 3 |
+
- Alpha-VLLM/Lumina-Image-2.0
|
| 4 |
+
license: other
|
| 5 |
+
license_name: fair-ai-public-license-1.0-sd
|
| 6 |
+
license_link: https://freedevproject.org/faipl-1.0-sd/
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+

|
| 10 |
+
|
| 11 |
+
# Introduction
|
| 12 |
+
|
| 13 |
+
**Neta Lumina** is a high‑quality anime‑style image‑generation model developed by Neta.art Lab.
|
| 14 |
+
Building on the open‑source **Lumina‑Image‑2.0** released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more.
|
| 15 |
+
|
| 16 |
+
## Key Features
|
| 17 |
+
|
| 18 |
+
- Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc.
|
| 19 |
+
- Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!)
|
| 20 |
+
- Accurate natural‑language understanding with excellent adherence to complex prompts.
|
| 21 |
+
- Native multilingual support, with Chinese, English, and Japanese recommended first.
|
| 22 |
+
|
| 23 |
+
## Model Versions
|
| 24 |
+
|
| 25 |
+
### Base Model
|
| 26 |
+
|
| 27 |
+
Request access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are inteseted.
|
| 28 |
+
|
| 29 |
+
- **Primary Goal**: General knowledge and anime‑style optimization
|
| 30 |
+
- **Data Set**: >13 million anime‑style images
|
| 31 |
+
- **>46,000** A100 Hours
|
| 32 |
+
|
| 33 |
+
### neta-lumina-beta-0624
|
| 34 |
+
|
| 35 |
+
- First beta release candidate
|
| 36 |
+
- **Primary Goal**: Enhanced aesthetics, pose accuracy, and scene detail
|
| 37 |
+
- **Data Set**: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on the Base)
|
| 38 |
+
|
| 39 |
+
## How to Use
|
| 40 |
+
|
| 41 |
+
### HuggingFace Playground
|
| 42 |
+
[Try it here](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground)
|
| 43 |
+
|
| 44 |
+
### ComfyUI
|
| 45 |
+
Neta Lumina is built on the **Lumina2 Diffusion Transformer (DiT)** framework, please follow these steps precisely.
|
| 46 |
+
|
| 47 |
+
#### Environment Requirements
|
| 48 |
+
|
| 49 |
+
Currently Neta Lumina runs only on ComfyUI:
|
| 50 |
+
- Latest ComfyUI installation
|
| 51 |
+
- ≥ 8 GB VRAM
|
| 52 |
+
|
| 53 |
+
#### Downloads & Installation
|
| 54 |
+
|
| 55 |
+
**Original (component) release**
|
| 56 |
+
|
| 57 |
+
1. **Neta Lumina-Beta**
|
| 58 |
+
- Download link: https://huggingface.co/neta-art/Neta-Lumina/blob/main/neta-lumina-beta-0624.pth
|
| 59 |
+
- Save path: `ComfyUI/models/unet/`
|
| 60 |
+
2. **Text Encoder (Gemma-2B)**
|
| 61 |
+
- Download link:https://huggingface.co/neta-art/Neta-Lumina/resolve/main/gemma_2_2b_fp16.safetensors
|
| 62 |
+
- Save path: `ComfyUI/models/text_encoders/`
|
| 63 |
+
3. **VAE Model (16-Channel FLUX VAE)**
|
| 64 |
+
- Download link: https://huggingface.co/neta-art/Neta-Lumina/resolve/main/ae.safetensors
|
| 65 |
+
- Save path: `ComfyUI/models/vae/`
|
| 66 |
+
|
| 67 |
+
**Workflow**: load [`lumina_workflow.json`](https://huggingface.co/neta-art/NetaLumina_Alpha/blob/main/lumina_workflow.json) in ComfyUI.
|
| 68 |
+
- `UNETLoader` – loads the `.pth`
|
| 69 |
+
- `VAELoader` – loads `ae.safetensors`
|
| 70 |
+
- `CLIPLoader` – loads `gemma_2_2b_fp16.safetensors`
|
| 71 |
+
- `Text Encoder` – connects positive /negative prompts to K Sampler
|
| 72 |
+
|
| 73 |
+
**Simple merged release**
|
| 74 |
+
Download [`neta-lumina-beta-0624.safetensors`](https://huggingface.co/neta-art/Neta-Lumina/tree/main),
|
| 75 |
+
`md5sum = dca54fef3c64e942c1a62a741c4f9d8a`,
|
| 76 |
+
you may use ComfyUI’s simple checkpoint loader workflow.
|
| 77 |
+
|
| 78 |
+
### Recommended Settings
|
| 79 |
+
|
| 80 |
+
- **Sampler**: `res_multistep`
|
| 81 |
+
- **Scheduler**: `linear_quadratic`
|
| 82 |
+
- **Steps**: 30
|
| 83 |
+
- **CFG (guidance)**: 4 – 5.5
|
| 84 |
+
- **EmptySD3LatentImage resolution**: 1024 × 1024, 768 × 1532, or 968 × 1322
|
| 85 |
+
|
| 86 |
+
## Prompt Book
|
| 87 |
+
|
| 88 |
+
Detailed prompt guidelines: [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RVBgwvzBqiCvQ7kOMm1cM6NdnNc)
|
| 89 |
+
|
| 90 |
+
## Community
|
| 91 |
+
|
| 92 |
+
- Discord: https://discord.com/invite/TTTGccjbEa
|
| 93 |
+
- QQ group: 785779037
|
| 94 |
+
|
| 95 |
+
## Roadmap
|
| 96 |
+
|
| 97 |
+
### Model
|
| 98 |
+
|
| 99 |
+
- Continous base‑model training to raise reasoning capability.
|
| 100 |
+
- Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness.
|
| 101 |
+
- Smarter, more versatile tagging tools to lower the creative barrier.
|
| 102 |
+
|
| 103 |
+
### Ecosystem
|
| 104 |
+
|
| 105 |
+
- LoRA training tutorials and components
|
| 106 |
+
- Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code.
|
| 107 |
+
- Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [**Call for Collaboration!**](https://discord.com/invite/TTTGccjbEa)
|
| 108 |
+
|
| 109 |
+
## License & Disclaimer
|
| 110 |
+
|
| 111 |
+
- Neta Lumina is released under the [**Fair AI Public License 1.0‑SD**](https://freedevproject.org/faipl-1.0-sd/)
|
| 112 |
+
- Any modifications, merges, or derivative models must themselves be open‑sourced.
|
| 113 |
+
|
| 114 |
+
## Participants & Contributors
|
| 115 |
+
|
| 116 |
+
- Special thanks to the **Alpha‑VLLM** team for open‑sourcing **Lumina‑Image‑2.0**
|
| 117 |
+
- Model development: **Neta.art Lab (Civitai)**
|
| 118 |
+
- Core Trainer **li_li** [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii)
|
| 119 |
+
-
|
| 120 |
+
- **Partners**
|
| 121 |
+
- **nebulae**: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis)
|
| 122 |
+
- [**narugo1992**](https://github.com/narugo1992) & [**deepghs**](https://huggingface.co/deepghs): open datasets, processing tools, and models
|
| 123 |
+
- [**Naifu**](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill)
|
| 124 |
+
|
| 125 |
+
### Community Contributors
|
| 126 |
+
|
| 127 |
+
**Evaluators & developers**: 二小姐, spawner, Rnglg2
|
| 128 |
+
**Other contributors**: 沉迷摸鱼, poi氵, ashan, 十分无奈, GHOSTLXH, wenaka, iiiiii, 年糕特工队, 恩匹希, 奶冻美宣集, mumu, yizyin, smile
|
| 129 |
+
|
| 130 |
+
## Appendix & Resources
|
| 131 |
+
|
| 132 |
+
- **TeaCache**: https://github.com/spawner1145/sd-samplers.git
|
| 133 |
+
- **Advanced samplers & TeaCache guide (by spawner)**: https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1
|
| 134 |
+
- **Neta Lumina ComfyUI Manual (in Chinese)**: https://docs.qq.com/doc/DZEVQZFdtaERPdXVh
|