Instructions to use unsloth/Z-Image-Turbo-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Local Apps
- Unsloth Studio new
How to use unsloth/Z-Image-Turbo-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/Z-Image-Turbo-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for unsloth/Z-Image-Turbo-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for unsloth/Z-Image-Turbo-GGUF to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="unsloth/Z-Image-Turbo-GGUF", max_seq_length=2048, )
Upload GGUF quantizations of Z-Image-Turbo
Browse files- .gitattributes +26 -0
- README.md +220 -0
- assets/DMDR.webp +3 -0
- assets/Z-Image-Gallery.pdf +3 -0
- assets/architecture.webp +3 -0
- assets/decoupled-dmd.webp +3 -0
- assets/leaderboard.png +3 -0
- assets/leaderboard.webp +0 -0
- assets/reasoning.png +3 -0
- assets/showcase.jpg +3 -0
- assets/showcase_editing.png +3 -0
- assets/showcase_realistic.png +3 -0
- assets/showcase_rendering.png +3 -0
- z-image-turbo-BF16.gguf +3 -0
- z-image-turbo-F16.gguf +3 -0
- z-image-turbo-Q2_K.gguf +3 -0
- z-image-turbo-Q3_K_L.gguf +3 -0
- z-image-turbo-Q3_K_M.gguf +3 -0
- z-image-turbo-Q3_K_S.gguf +3 -0
- z-image-turbo-Q4_0.gguf +3 -0
- z-image-turbo-Q4_1.gguf +3 -0
- z-image-turbo-Q4_K_M.gguf +3 -0
- z-image-turbo-Q4_K_S.gguf +3 -0
- z-image-turbo-Q5_0.gguf +3 -0
- z-image-turbo-Q5_1.gguf +3 -0
- z-image-turbo-Q5_K_M.gguf +3 -0
- z-image-turbo-Q5_K_S.gguf +3 -0
- z-image-turbo-Q6_K.gguf +3 -0
- z-image-turbo-Q8_0.gguf +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,29 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
assets/DMDR.webp filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
assets/Z-Image-Gallery.pdf filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
assets/architecture.webp filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
assets/decoupled-dmd.webp filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
assets/leaderboard.png filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
assets/reasoning.png filter=lfs diff=lfs merge=lfs -text
|
| 42 |
+
assets/showcase.jpg filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
assets/showcase_editing.png filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
assets/showcase_realistic.png filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
assets/showcase_rendering.png filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
z-image-turbo-BF16.gguf filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
z-image-turbo-F16.gguf filter=lfs diff=lfs merge=lfs -text
|
| 48 |
+
z-image-turbo-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
| 49 |
+
z-image-turbo-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
| 50 |
+
z-image-turbo-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 51 |
+
z-image-turbo-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 52 |
+
z-image-turbo-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
| 53 |
+
z-image-turbo-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
| 54 |
+
z-image-turbo-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
z-image-turbo-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 56 |
+
z-image-turbo-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
| 57 |
+
z-image-turbo-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
|
| 58 |
+
z-image-turbo-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
| 59 |
+
z-image-turbo-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
| 60 |
+
z-image-turbo-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
| 61 |
+
z-image-turbo-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,220 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: Tongyi-MAI/Z-Image-Turbo
|
| 3 |
+
license: apache-2.0
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
pipeline_tag: text-to-image
|
| 7 |
+
library_name: ggml
|
| 8 |
+
tags:
|
| 9 |
+
- gguf
|
| 10 |
+
- quantized
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
> [!NOTE]
|
| 14 |
+
> This is a GGUF quantized version of [Z-Image-Turbo](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo).
|
| 15 |
+
> unsloth/Z-Image-Turbo-GGUF uses [Unsloth Dynamic 2.0](https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs) methodology for SOTA performance. Important layers are upcasted to higher precision.
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
<div>
|
| 19 |
+
<div style="display: flex; gap: 5px; align-items: center; ">
|
| 20 |
+
<a href="https://github.com/unslothai/unsloth/">
|
| 21 |
+
<img src="https://github.com/unslothai/unsloth/raw/main/images/unsloth%20new%20logo.png" width="133">
|
| 22 |
+
</a>
|
| 23 |
+
<a href="https://discord.gg/unsloth">
|
| 24 |
+
<img src="https://github.com/unslothai/unsloth/raw/main/images/Discord%20button.png" width="173">
|
| 25 |
+
</a>
|
| 26 |
+
<a href="https://docs.unsloth.ai/new/ministral-3">
|
| 27 |
+
<img src="https://raw.githubusercontent.com/unslothai/unsloth/refs/heads/main/images/documentation%20green%20button.png" width="143">
|
| 28 |
+
</a>
|
| 29 |
+
</div>
|
| 30 |
+
</div>
|
| 31 |
+
|
| 32 |
+
---
|
| 33 |
+
<h1 align="center">⚡️- Image<br><sub><sup>An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer</sup></sub></h1>
|
| 34 |
+
|
| 35 |
+
<div align="center">
|
| 36 |
+
|
| 37 |
+
[](https://tongyi-mai.github.io/Z-Image-blog/) 
|
| 38 |
+
[](https://github.com/Tongyi-MAI/Z-Image) 
|
| 39 |
+
[](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) 
|
| 40 |
+
[](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) 
|
| 41 |
+
[](https://huggingface.co/spaces/akhaliq/Z-Image-Turbo) 
|
| 42 |
+
[](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) 
|
| 43 |
+
[](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%253A%252F%252FTongyi-MAI%252FZ-Image-Turbo%253Frevision%253Dmaster%7D%7BOnline) 
|
| 44 |
+
[](assets/Z-Image-Gallery.pdf) 
|
| 45 |
+
[](https://modelscope.cn/studios/Tongyi-MAI/Z-Image-Gallery/summary) 
|
| 46 |
+
<a href="https://arxiv.org/abs/2511.22699" target="_blank"><img src="https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv" height="21px"></a>
|
| 47 |
+
|
| 48 |
+
|
| 49 |
+
Welcome to the official repository for the Z-Image(造相)project!
|
| 50 |
+
|
| 51 |
+
</div>
|
| 52 |
+
|
| 53 |
+
|
| 54 |
+
|
| 55 |
+
## ✨ Z-Image
|
| 56 |
+
|
| 57 |
+
Z-Image is a powerful and highly efficient image generation model with **6B** parameters. Currently there are three variants:
|
| 58 |
+
|
| 59 |
+
- 🚀 **Z-Image-Turbo** – A distilled version of Z-Image that matches or exceeds leading competitors with only **8 NFEs** (Number of Function Evaluations). It offers **⚡️sub-second inference latency⚡️** on enterprise-grade H800 GPUs and fits comfortably within **16G VRAM consumer devices**. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence.
|
| 60 |
+
|
| 61 |
+
- 🧱 **Z-Image-Base** – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
|
| 62 |
+
|
| 63 |
+
- ✍️ **Z-Image-Edit** – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts.
|
| 64 |
+
|
| 65 |
+
### 📥 Model Zoo
|
| 66 |
+
|
| 67 |
+
| Model | Hugging Face | ModelScope |
|
| 68 |
+
| :--- |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
| 69 |
+
| **Z-Image-Turbo** | [](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) <br> [](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) | [](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) <br> [](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%3A%2F%2FTongyi-MAI%2FZ-Image-Turbo%3Frevision%3Dmaster) |
|
| 70 |
+
| **Z-Image-Base** | *To be released* | *To be released* |
|
| 71 |
+
| **Z-Image-Edit** | *To be released* | *To be released* |
|
| 72 |
+
|
| 73 |
+
### 🖼️ Showcase
|
| 74 |
+
|
| 75 |
+
📸 **Photorealistic Quality**: **Z-Image-Turbo** delivers strong photorealistic image generation while maintaining excellent aesthetic quality.
|
| 76 |
+
|
| 77 |
+

|
| 78 |
+
|
| 79 |
+
📖 **Accurate Bilingual Text Rendering**: **Z-Image-Turbo** excels at accurately rendering complex Chinese and English text.
|
| 80 |
+
|
| 81 |
+

|
| 82 |
+
|
| 83 |
+
💡 **Prompt Enhancing & Reasoning**: Prompt Enhancer empowers the model with reasoning capabilities, enabling it to transcend surface-level descriptions and tap into underlying world knowledge.
|
| 84 |
+
|
| 85 |
+

|
| 86 |
+
|
| 87 |
+
🧠 **Creative Image Editing**: **Z-Image-Edit** shows a strong understanding of bilingual editing instructions, enabling imaginative and flexible image transformations.
|
| 88 |
+
|
| 89 |
+

|
| 90 |
+
|
| 91 |
+
### 🏗️ Model Architecture
|
| 92 |
+
We adopt a **Scalable Single-Stream DiT** (S3-DiT) architecture. In this setup, text, visual semantic tokens, and image VAE tokens are concatenated at the sequence level to serve as a unified input stream, maximizing parameter efficiency compared to dual-stream approaches.
|
| 93 |
+
|
| 94 |
+

|
| 95 |
+
|
| 96 |
+
### 📈 Performance
|
| 97 |
+
According to the Elo-based Human Preference Evaluation (on [*Alibaba AI Arena*](https://aiarena.alibaba-inc.com/corpora/arena/leaderboard?arenaType=T2I)), Z-Image-Turbo shows highly competitive performance against other leading models, while achieving state-of-the-art results among open-source models.
|
| 98 |
+
|
| 99 |
+
<p align="center">
|
| 100 |
+
<a href="https://aiarena.alibaba-inc.com/corpora/arena/leaderboard?arenaType=T2I">
|
| 101 |
+
<img src="assets/leaderboard.png" alt="Z-Image Elo Rating on AI Arena"/><br />
|
| 102 |
+
<span style="font-size:1.05em; cursor:pointer; text-decoration:underline;"> Click to view the full leaderboard</span>
|
| 103 |
+
</a>
|
| 104 |
+
</p>
|
| 105 |
+
|
| 106 |
+
### 🚀 Quick Start
|
| 107 |
+
Install the latest version of diffusers, use the following command:
|
| 108 |
+
<details>
|
| 109 |
+
<summary><sup>Click here for details for why you need to install diffusers from source</sup></summary>
|
| 110 |
+
|
| 111 |
+
We have submitted two pull requests ([#12703](https://github.com/huggingface/diffusers/pull/12703) and [#12715](https://github.com/huggingface/diffusers/pull/12715)) to the 🤗 diffusers repository to add support for Z-Image. Both PRs have been merged into the latest official diffusers release.
|
| 112 |
+
Therefore, you need to install diffusers from source for the latest features and Z-Image support.
|
| 113 |
+
|
| 114 |
+
</details>
|
| 115 |
+
|
| 116 |
+
```bash
|
| 117 |
+
pip install git+https://github.com/huggingface/diffusers
|
| 118 |
+
```
|
| 119 |
+
|
| 120 |
+
```python
|
| 121 |
+
import torch
|
| 122 |
+
from diffusers import ZImagePipeline
|
| 123 |
+
|
| 124 |
+
# 1. Load the pipeline
|
| 125 |
+
# Use bfloat16 for optimal performance on supported GPUs
|
| 126 |
+
pipe = ZImagePipeline.from_pretrained(
|
| 127 |
+
"Tongyi-MAI/Z-Image-Turbo",
|
| 128 |
+
torch_dtype=torch.bfloat16,
|
| 129 |
+
low_cpu_mem_usage=False,
|
| 130 |
+
)
|
| 131 |
+
pipe.to("cuda")
|
| 132 |
+
|
| 133 |
+
# [Optional] Attention Backend
|
| 134 |
+
# Diffusers uses SDPA by default. Switch to Flash Attention for better efficiency if supported:
|
| 135 |
+
# pipe.transformer.set_attention_backend("flash") # Enable Flash-Attention-2
|
| 136 |
+
# pipe.transformer.set_attention_backend("_flash_3") # Enable Flash-Attention-3
|
| 137 |
+
|
| 138 |
+
# [Optional] Model Compilation
|
| 139 |
+
# Compiling the DiT model accelerates inference, but the first run will take longer to compile.
|
| 140 |
+
# pipe.transformer.compile()
|
| 141 |
+
|
| 142 |
+
# [Optional] CPU Offloading
|
| 143 |
+
# Enable CPU offloading for memory-constrained devices.
|
| 144 |
+
# pipe.enable_model_cpu_offload()
|
| 145 |
+
|
| 146 |
+
prompt = "Young Chinese woman in red Hanfu, intricate embroidery. Impeccable makeup, red floral forehead pattern. Elaborate high bun, golden phoenix headdress, red flowers, beads. Holds round folding fan with lady, trees, bird. Neon lightning-bolt lamp (⚡️), bright yellow glow, above extended left palm. Soft-lit outdoor night background, silhouetted tiered pagoda (西安大雁塔), blurred colorful distant lights."
|
| 147 |
+
|
| 148 |
+
# 2. Generate Image
|
| 149 |
+
image = pipe(
|
| 150 |
+
prompt=prompt,
|
| 151 |
+
height=1024,
|
| 152 |
+
width=1024,
|
| 153 |
+
num_inference_steps=9, # This actually results in 8 DiT forwards
|
| 154 |
+
guidance_scale=0.0, # Guidance should be 0 for the Turbo models
|
| 155 |
+
generator=torch.Generator("cuda").manual_seed(42),
|
| 156 |
+
).images[0]
|
| 157 |
+
|
| 158 |
+
image.save("example.png")
|
| 159 |
+
```
|
| 160 |
+
|
| 161 |
+
## 🔬 Decoupled-DMD: The Acceleration Magic Behind Z-Image
|
| 162 |
+
|
| 163 |
+
[](https://arxiv.org/abs/2511.22677)
|
| 164 |
+
|
| 165 |
+
Decoupled-DMD is the core few-step distillation algorithm that empowers the 8-step Z-Image model.
|
| 166 |
+
|
| 167 |
+
Our core insight in Decoupled-DMD is that the success of existing DMD (Distributaion Matching Distillation) methods is the result of two independent, collaborating mechanisms:
|
| 168 |
+
|
| 169 |
+
- **CFG Augmentation (CA)**: The primary **engine** 🚀 driving the distillation process, a factor largely overlooked in previous work.
|
| 170 |
+
- **Distribution Matching (DM)**: Acts more as a **regularizer** ⚖️, ensuring the stability and quality of the generated output.
|
| 171 |
+
|
| 172 |
+
By recognizing and decoupling these two mechanisms, we were able to study and optimize them in isolation. This ultimately motivated us to develop an improved distillation process that significantly enhances the performance of few-step generation.
|
| 173 |
+
|
| 174 |
+

|
| 175 |
+
|
| 176 |
+
## 🤖 DMDR: Fusing DMD with Reinforcement Learning
|
| 177 |
+
|
| 178 |
+
[](https://arxiv.org/abs/2511.13649)
|
| 179 |
+
|
| 180 |
+
Building upon the strong foundation of Decoupled-DMD, our 8-step Z-Image model has already demonstrated exceptional capabilities. To achieve further improvements in terms of semantic alignment, aesthetic quality, and structural coherence—while producing images with richer high-frequency details—we present **DMDR**.
|
| 181 |
+
|
| 182 |
+
Our core insight behind DMDR is that Reinforcement Learning (RL) and Distribution Matching Distillation (DMD) can be synergistically integrated during the post-training of few-step models. We demonstrate that:
|
| 183 |
+
|
| 184 |
+
- **RL Unlocks the Performance of DMD** 🚀
|
| 185 |
+
- **DMD Effectively Regularizes RL** ⚖️
|
| 186 |
+
|
| 187 |
+

|
| 188 |
+
|
| 189 |
+
## ⏬ Download
|
| 190 |
+
```bash
|
| 191 |
+
pip install -U huggingface_hub
|
| 192 |
+
HF_XET_HIGH_PERFORMANCE=1 hf download Tongyi-MAI/Z-Image-Turbo
|
| 193 |
+
```
|
| 194 |
+
|
| 195 |
+
## 📜 Citation
|
| 196 |
+
|
| 197 |
+
If you find our work useful in your research, please consider citing:
|
| 198 |
+
|
| 199 |
+
```bibtex
|
| 200 |
+
@article{team2025zimage,
|
| 201 |
+
title={Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer},
|
| 202 |
+
author={Z-Image Team},
|
| 203 |
+
journal={arXiv preprint arXiv:2511.22699},
|
| 204 |
+
year={2025}
|
| 205 |
+
}
|
| 206 |
+
|
| 207 |
+
@article{liu2025decoupled,
|
| 208 |
+
title={Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield},
|
| 209 |
+
author={Dongyang Liu and Peng Gao and David Liu and Ruoyi Du and Zhen Li and Qilong Wu and Xin Jin and Sihan Cao and Shifeng Zhang and Hongsheng Li and Steven Hoi},
|
| 210 |
+
journal={arXiv preprint arXiv:2511.22677},
|
| 211 |
+
year={2025}
|
| 212 |
+
}
|
| 213 |
+
|
| 214 |
+
@article{jiang2025distribution,
|
| 215 |
+
title={Distribution Matching Distillation Meets Reinforcement Learning},
|
| 216 |
+
author={Jiang, Dengyang and Liu, Dongyang and Wang, Zanyi and Wu, Qilong and Jin, Xin and Liu, David and Li, Zhen and Wang, Mengmeng and Gao, Peng and Yang, Harry},
|
| 217 |
+
journal={arXiv preprint arXiv:2511.13649},
|
| 218 |
+
year={2025}
|
| 219 |
+
}
|
| 220 |
+
```
|
assets/DMDR.webp
ADDED
|
Git LFS Details
|
assets/Z-Image-Gallery.pdf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6f9895b3246d2547bac74bbe0be975da500eaae93f2cad4248ad3281786b1ac6
|
| 3 |
+
size 15767436
|
assets/architecture.webp
ADDED
|
Git LFS Details
|
assets/decoupled-dmd.webp
ADDED
|
Git LFS Details
|
assets/leaderboard.png
ADDED
|
Git LFS Details
|
assets/leaderboard.webp
ADDED
|
assets/reasoning.png
ADDED
|
Git LFS Details
|
assets/showcase.jpg
ADDED
|
Git LFS Details
|
assets/showcase_editing.png
ADDED
|
Git LFS Details
|
assets/showcase_realistic.png
ADDED
|
Git LFS Details
|
assets/showcase_rendering.png
ADDED
|
Git LFS Details
|
z-image-turbo-BF16.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9ccc4fb8b84519b8aa201abcffd900d19ee146b409ae9897625d235b104ef9c5
|
| 3 |
+
size 12311939136
|
z-image-turbo-F16.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:996ed5b13fc124053edec41d3ad8fef3fe1dca0ef3f7aa24b4123d84367d3666
|
| 3 |
+
size 12311939136
|
z-image-turbo-Q2_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:521621bb89c57a23df35479c32376016e6ee95a7ed6b3690d734506ec312e61b
|
| 3 |
+
size 3409283136
|
z-image-turbo-Q3_K_L.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9d8d87ad4576b1c4eda48e269c03ee0a990948589fcdbc5b329fd7466d1541ba
|
| 3 |
+
size 4322911296
|
z-image-turbo-Q3_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ab92a884969da679353b1af10b1857ed10dabc774ba908f78b7bb96800fcf1d9
|
| 3 |
+
size 4175455296
|
z-image-turbo-Q3_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:52c50c3f28869d754c9e88e6b9eafc51d25fa3a7f1f91fff8abf8dd7f55fc1d8
|
| 3 |
+
size 3886917696
|
z-image-turbo-Q4_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:302b9cf7e7ddbeffb472fde1a9e2ea436a5edf1f7119c617a18e5222c8078921
|
| 3 |
+
size 4585244736
|
z-image-turbo-Q4_1.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:63c9a92831a0adfdf3f06b7fef94f2e4faba2578ed2034d8bb04821683305b43
|
| 3 |
+
size 4850665536
|
z-image-turbo-Q4_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e375ea8dbfeeddbcd5fdeb77d3c2ef9dee90fc5773aaad0bc8be78f752e7802b
|
| 3 |
+
size 5030193216
|
z-image-turbo-Q4_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ee432feb7ba952f2b7cba59e2644d862d9c6aae048cb1f457264d4e88e9774ee
|
| 3 |
+
size 4729751616
|
z-image-turbo-Q5_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6a83bb03ea9f9c0e8760ef25252a3364310a164608cfaca0d505cf4d7592d8c8
|
| 3 |
+
size 5263542336
|
z-image-turbo-Q5_1.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e84236eaed78b1ff1f05db0dced5a9634c0ffcc2dc9feffcc6a8fb744b430e7b
|
| 3 |
+
size 5528963136
|
z-image-turbo-Q5_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:847e33003ae6e85121c62785d7f7213958d58287c932d0c2189a1c9a2d1f093a
|
| 3 |
+
size 5574444096
|
z-image-turbo-Q5_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:529d8d70efb0df9e9547e6a8560db88cd1c21abcac9c8cc599aece240b578fc6
|
| 3 |
+
size 5285906496
|
z-image-turbo-Q6_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fc137d87b49e06fdd5230d67d6c8cfa42a9e1fd38b65ccd355882450c3eb1c82
|
| 3 |
+
size 5910505536
|
z-image-turbo-Q8_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f163d60b0eb427469510b8226243d196574a18139a2e40c017409cfbda95ecfe
|
| 3 |
+
size 7224707136
|