Recursos

Explore a coleção completa de recursos do ecossistema Z-Image

Modelos de Código Aberto11

Arquitetura e Visão do Z-Image-Omni-Base

Omni-Base

Pré-treino Omni

Arquitetura Unificada

S3-DiT

Geração e Edição

Z-Image-Omni-Base

Evolução estratégica do Z-Image-Base, apresentando pré-treino omni para geração e edição unificadas de imagens, evitando complexidade e perda de desempenho na troca de tarefas.

O Z-Image-Omni-Base marca uma mudança estratégica do modelo 'Base' original para uma arquitetura de pré-treino 'omni' (omnipotente). Ele unifica as tarefas de geração de imagem e edição/inpainting dentro de uma única estrutura usando o Scalable Single-Stream Diffusion Transformer (S3-DiT). Este pré-treino omni permite transições perfeitas entre a geração de novas imagens e a edição das existentes sem a necessidade de modelos especializados separados, oferecendo maior eficiência de parâmetros e flexibilidade para programadores.

Desempenho do Leaderboard Z-Image

Modelo de Fundação

S3-DiT

6B Parâmetros

Open Source SOTA

Z-Image

Um modelo de fundação de geração de imagem eficiente com Single-Stream Diffusion Transformer de 6B parâmetros.

O Z-Image é um modelo de fundação de geração de imagem poderoso e altamente eficiente com 6B parâmetros. Aproveitando a arquitetura Scalable Single-Stream Diffusion Transformer (S3-DiT), ele processa texto, tokens semânticos visuais e tokens VAE de imagem como um fluxo unificado. O Z-Image serve como núcleo para variantes como Z-Image-Turbo e Z-Image-Omni-Base, oferecendo desempenho de ponta entre modelos de código aberto.

Z-Image Elo Rating on AI Arena

Open Source

Code

Documentation

6B Parameters

Z-Image GitHub Repository

Z-Image open source main repository, containing complete model code and documentation, 6B parameter efficient image generation model

Z-Image is a powerful and efficient image generation model with 6B parameters. Currently there are three variants: Z-Image-Turbo (distilled version, only 8-step inference), Z-Image-Omni-Base (base model) and Z-Image-Edit (image editing variant).

Z-Image Turbo on ModelScope

ModelScope

Online Experience

Turbo

API

Z-Image Turbo on ModelScope

Z-Image Turbo model on ModelScope platform, providing online experience and API interface

Experience Z-Image Turbo model on ModelScope platform, providing online inference service and API interface for quick integration and use by developers.

Z-Image Elo Rating on AI Arena

HuggingFace

Community

Model

110k+ Downloads

Z-Image Turbo on HuggingFace

Z-Image Turbo model on HuggingFace platform, with over 110k monthly downloads

Z-Image Turbo model in HuggingFace community, providing complete model weights, usage examples and community support. Monthly downloads reach 111,244.

Z-Image De-Turbo Generation Sample

De-distilled

LoRA Training

Deep Fine-tuning

ComfyUI

Trainability

Z-Image De-Turbo De-distilled Model

De-distilled version of Z-Image model, breaking turbo distillation limits and restoring trainability and flexibility

Z-Image De-Turbo is a de-distilled version of Tongyi-MAI/Z-Image-Turbo, fine-tuned on images generated by Z-Image-Turbo to break down the turbo distillation limitations. This model is specifically designed for training and deep fine-tuning, offering enhanced trainability and flexibility compared to the original turbo model.

ComfyUI Z-Image workflow interface

ComfyUI

Tutorial

Workflow

Local Deployment

Z-Image-Turbo

8-Step Generation

Chinese Text

Z-Image Turbo ComfyUI Tutorial

Official ComfyUI tutorial for Z-Image Turbo, providing complete workflow setup, model download guide, and parameter configuration for optimal performance

Official ComfyUI documentation for Z-Image Turbo, featuring complete workflow templates, detailed model download instructions, and optimization settings for low VRAM devices. Perfect for users who want local deployment with full control over the generation process.

PrunaAI optimization performance comparison

Replicate

PrunaAI

Optimized

Performance

Speed Boost

Cost Effective

Apache 2.0

8-Step Generation

Z-Image Turbo (PrunaAI Optimized)

PrunaAI-optimized version of Z-Image Turbo with enhanced speed through smart caching, model compilation, and quantization while maintaining photorealistic quality and Chinese text rendering capabilities

PrunaAI's optimized version of Tongyi-MAI's Z-Image Turbo, accelerated through advanced compression techniques. This version applies smart caching, model compilation, and quantization to make image generation even faster while preserving the original model's photorealistic quality and excellent Chinese text rendering capabilities.

GGUF-Org Z-Image GGUF Models

GGUF

Quantization

Low VRAM

ComfyUI

6GB Compatible

Consumer GPU

GGUF-Org Z-Image GGUF Models

Official GGUF quantized versions of Z-Image Turbo models for low-VRAM deployment (6GB+), optimized for ComfyUI with multiple quantization levels

GGUF-Org provides officially converted GGUF quantized versions of Z-Image Turbo, enabling deployment on consumer-grade GPUs with as low as 6GB VRAM. Supports multiple quantization levels including Q3_K_S, IQ4_NL, and IQ4_XS for different VRAM/quality trade-offs.

Qwen3-4B GGUF Text Encoder

Text Encoder

LLM

GGUF

Qwen3

4B Parameters

Bilingual

Thinking Mode

Qwen3-4B GGUF Text Encoder

GGUF quantized version of Qwen3-4B language model, essential text encoder for Z-Image GGUF deployments with thinking mode support

Qwen3-4B-GGUF is the required text encoder for Z-Image GGUF deployments, providing bilingual (Chinese/English) understanding and advanced reasoning capabilities. Supports unique thinking mode for complex logical reasoning tasks.

Jayn7 Z-Image Turbo GGUF Collection

GGUF

Community

Quantization

ComfyUI

Low VRAM

Tutorial

Jayn7 Z-Image Turbo GGUF Collection

Community-maintained GGUF collection of Z-Image Turbo with multiple quantization options and ComfyUI integration guides

Community-maintained GGUF model collection by Jayn7, providing multiple quantization variants of Z-Image Turbo with detailed ComfyUI setup guides. Popular choice with over 200 likes in the community.

ControlNet e LoRA3

Pixel art example 1

LoRA

Pixel Art

Stylization

AI Toolkit

Pixel Art Style LoRA for Z-Image Turbo

Pixel art style LoRA specially designed for Z-Image Turbo, enhancing pixel art generation capabilities

This LoRA model enhances Z-Image's existing pixel art capabilities, making them more detailed and refined. No trigger words required, but using "pixel art" in prompts can achieve better results.

Pose control input

ControlNet

Image Control

Multi-functional

PAI

Z-Image Turbo Fun ControlNet Union

Multi-functional ControlNet released by Alibaba PAI, supporting Canny, HED, Depth, Pose and MLSD controls

This is a ControlNet model with 6 blocks added, trained from scratch on 1 million high-quality image dataset for 10,000 steps, supporting multiple control conditions.

Abstract Vector style input image

Image to LoRA

Style Transfer

LoRA Generation

DiffSynth Studio

Multi-Model System

Qwen-Image-i2L (Image to LoRA)

Revolutionary Image to LoRA model that takes images as input and outputs trained LoRA models, enabling instant style transfer and content preservation

Qwen-Image-i2L is an innovative model that takes images as input and directly outputs LoRA weights trained on those images. The system includes four specialized models: i2L-Style for style transfer, i2L-Coarse and i2L-Fine for content preservation, and i2L-Bias for output alignment with Qwen-Image aesthetics.

Demonstrações de Aplicações5

Z-Image Turbo demo interface

HuggingFace Space

Official Demo

Online Experience

Zero GPU

Z-Image Turbo Official Demo

Z-Image Turbo official online demo application, providing direct experience of Z-Image generation capabilities

Z-Image Turbo official online demo application, maintained by Tongyi-MAI team, providing a platform to directly experience Z-Image generation capabilities. Runs on Zero GPU, no local configuration required.

Z-Image Gallery showcase

ModelScope Studio

Official Gallery

Chinese Interface

Effect Showcase

Z-Image Gallery ModelScope

Z-Image gallery application on ModelScope platform, showcasing model generation effects

Z-Image gallery application on ModelScope platform, showcasing Z-Image model generation effects in gallery format, providing Chinese interface optimization and integrated ModelScope ecosystem.

Smart frame application

HuggingFace Space

Smart Frame

Birthday Project

Creative Application

MCP 1st Birthday Smart Frame

Smart frame application, birthday commemorative project based on Z-Image

Smart frame application based on Z-Image, specially developed for MCP 1st birthday commemorative project, showcasing Z-Image's possibilities in creative application scenarios.

LoRA gallery interface

HuggingFace Space

LoRA Gallery

Art Styles

Identity Models

Z.I.T. LoRAs Gallery

Z-Image custom LoRA model online gallery, supporting multiple radical art styles and identity models

Z-Image custom LoRA model online gallery, hosting multiple custom-trained LoRA models including radical art styles and identity models, supporting online switching and preview functionality.

Mystery game interface

HuggingFace Space

Mystery Game

Interactive Application

Boopster Murder Mystery

Interactive mystery game application based on Z-Image

Interactive mystery game application developed based on Z-Image, combining image generation and gameplay, showcasing Z-Image's innovative use in entertainment application scenarios.

Artigos Académicos1

Paper model architecture diagram

Paper

Research

DMDR

Reinforcement Learning

Distribution Matching Distillation Meets Reinforcement Learning

Z-Image core technical paper, introducing DMDR framework: integrating reinforcement learning into distribution matching distillation process

This paper proposes the DMDR framework, integrating reinforcement learning techniques into the distribution matching distillation process. Research shows that for reinforcement learning of few-step generators, the DMD loss itself is more effective than traditional regularization methods.

Blogue Oficial1

Z-Image generation effect showcase

Official

Blog

Updates

Bilingual

Z-Image Official Blog

Z-Image project official homepage, containing latest updates, technical introductions and community information

Z-Image project official homepage, providing comprehensive project introduction including core features, model architecture, performance evaluation and technical details. Supports bilingual content in Chinese and English.