Z-Image-Turbo-AIO / README.md
SeeSee21's picture
Update README.md
a4f2dc1 verified
|
raw
history blame
5.1 kB
---
license: apache-2.0
language:
- en
- zh
base_model:
- Tongyi-MAI/Z-Image-Turbo
pipeline_tag: text-to-image
library_name: diffusers
tags:
- text-to-image
- image-generation
- diffusion
- comfyui
- photorealistic
- bilingual
- chinese
- english
- 8-step
- fast-generation
---
# πŸš€ Z-Image-Turbo-AIO | 8-Step Photorealistic Generation
<div align="center">
**Ultra-Fast β€’ Bilingual Text Rendering β€’ All-in-One β€’ FP8 & BF16**
[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![ComfyUI](https://img.shields.io/badge/ComfyUI-Compatible-green.svg)](https://github.com/comfyanonymous/ComfyUI)
</div>
## ✨ What is Z-Image-Turbo-AIO?
Z-Image-Turbo-AIO is an **All-in-One repackage** of Alibaba Tongyi Lab's 6B parameter photorealistic image generator, optimized for lightning-fast 8-step generation. This version includes **integrated VAE and Text Encoder** for maximum convenience - just download and generate!
### Available Versions
| Version | Size | Best For |
|---------|------|----------|
| 🟑 **FP8-AIO** | ~10GB | Most users, testing, everyday use |
| 🌟 **BF16-AIO** | ~20GB | Maximum quality, professional work |
## 🎯 Key Features
- ⚑ **8-step generation** - 10-40 seconds per image
- πŸ“¦ **All-in-One** - No separate VAE/Text Encoder downloads needed
- πŸ“Έ **Photorealistic** - Professional quality output
- πŸ“– **Bilingual** - English & Chinese text rendering
- 🎯 **8GB VRAM** - Works on RTX 4060 and similar
- 🌐 **Apache 2.0** - Open license for any use
## πŸ”„ Which Version Should I Choose?
### 🟑 FP8-AIO (Recommended for most users)
- βœ… Half the file size
- βœ… Faster downloads
- βœ… Excellent quality
- βœ… Perfect for 8GB VRAM
- βœ… Great for testing & everyday use
### 🌟 BF16-AIO (Maximum precision)
- βœ… BFloat16 full precision
- βœ… Absolute best quality
- βœ… Professional/commercial grade
- βœ… Still works on 8GB VRAM
## πŸ“₯ Quick Start (ComfyUI)
### Installation
1. Download your preferred version (FP8 or BF16)
2. Place in `ComfyUI/models/checkpoints/`
3. Load with "Load Checkpoint" node
4. Generate!
### Recommended Settings
| Parameter | Value |
|-----------|-------|
| Steps | 8 |
| CFG | 1.0 |
| Sampler | res_multistep |
| Scheduler | simple |
| Resolution | 1920Γ—1088 |
**That's it! No separate VAE or Text Encoder needed!**
## πŸ“Š Performance
All tests on **RTX 4060 (8GB VRAM)** β€’ FP8 β€’ 1920Γ—1088 β€’ 8 steps
| Test | Generation Time |
|------|-----------------|
| Urban Interior | ~32s |
| Architecture | ~32-34s |
| Food Photography | ~32s |
| Bilingual Signage | ~32s |
## πŸ’‘ Prompting Guide
### βœ… Natural Language Works Best!
**Good Example:**
```
A cozy bookstore with floor-to-ceiling wooden shelves filled with
colorful books, comfortable reading nooks with cushions near large
windows, warm pendant lighting, peaceful afternoon atmosphere,
professional interior photography
```
**Bad Example:**
```
bookstore, books, chairs, window, cozy, warm light, interior
```
### πŸ“– Bilingual Text Rendering
**English Text:**
```
Neon sign reading "OPEN 24/7" in bright blue letters above entrance.
Modern sans-serif font, glowing effect against brick wall.
```
**Chinese Text:**
```
Traditional tea house entrance with sign reading "叀韡茢坊" in elegant
gold Chinese calligraphy on red wooden board with ornate carved border.
```
**Both Languages:**
```
Modern cafe exterior with bilingual sign. "Morning Brew Coffee" in
white elegant script above, "晨曦咖啑" in matching Chinese characters
below. Both glowing warmly at dusk.
```
### πŸ“ Prompting Tips
| Do βœ… | Don't ❌ |
|------|---------|
| Use natural language descriptions | Use tag-style prompts (tag1, tag2) |
| Be detailed (100-300 words optimal) | Write very short prompts (<50 words) |
| Include lighting and mood | Add negative prompts (not used) |
| Describe camera angle and style | Include conflicting instructions |
| Specify materials and colors | |
## πŸ™ Credits & Acknowledgments
### Original Model
- **Developer:** Tongyi Lab (Alibaba Group)
- **Architecture:** Single-Stream Diffusion Transformer (6B parameters)
- **Algorithm:** Decoupled-DMD + DMDR
- **License:** Apache 2.0
### AIO Conversion
- **Created by:** [SeeSee21](https://huggingface.co/SeeSee21)
- **Format:** Integrated VAE + Text Encoder
- **Purpose:** Simplified single-file deployment
### Resources
- πŸ€— [Original HuggingFace](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo)
- πŸ’» [GitHub Repository](https://github.com/Tongyi-MAI/Z-Image)
- 🎨 [ComfyUI Files](https://huggingface.co/Comfy-Org/z_image_turbo)
- πŸ–ΌοΈ [CivitAI Page](https://civitai.com/models/2173571)
## πŸ“ˆ Version History
### v1.0 - Initial AIO Release
- FP8-AIO version (10GB)
- BF16-AIO version (20GB)
- Integrated VAE + Text Encoder
- Single-file deployment
- Based on Tongyi-MAI/Z-Image-Turbo
- Tested on RTX 4060 8GB
- Optimized for 1920Γ—1088
---
<div align="center">
**Download, load with "Load Checkpoint", and generate professional photos in seconds! πŸš€**
</div>