Back to List

Z-Image

An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer with 6B parameters.

Foundation Model
S3-DiT
6B Parameters
Open Source SOTA

Overview

Z-Image is a powerful and highly efficient image generation foundation model with 6B parameters. Leveraging the Scalable Single-Stream Diffusion Transformer (S3-DiT) architecture, it processes text, visual semantic tokens, and image VAE tokens as a unified stream. Z-Image serves as the core for variants like Z-Image-Turbo and Z-Image-Omni-Base, delivering state-of-the-art performance among open-source models.

Features

  • 6B Parameter Efficient Foundation Model
  • Scalable Single-Stream DiT (S3-DiT) Architecture
  • Ranked #1 Open-Source Model on Artificial Analysis Leaderboard
  • Excellent Photorealistic Image Generation
  • Accurate Bilingual (English/Chinese) Text Rendering
  • Strong Prompt Enhancing and Reasoning Capabilities

Images

Z-Image Leaderboard Performance
Z-Image ranking on Artificial Analysis Text-to-Image Leaderboard

Related Links

Popular Prompts

Discover more creative ideas