Z-Image

An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer with 6B parameters.

Foundation Model

S3-DiT

6B Parameters

Open Source SOTA

Overview

Z-Image is a powerful and highly efficient image generation foundation model with 6B parameters. Leveraging the Scalable Single-Stream Diffusion Transformer (S3-DiT) architecture, it processes text, visual semantic tokens, and image VAE tokens as a unified stream. Z-Image serves as the core for variants like Z-Image-Turbo and Z-Image-Omni-Base, delivering state-of-the-art performance among open-source models.

Features

6B Parameter Efficient Foundation Model
Scalable Single-Stream DiT (S3-DiT) Architecture
Ranked #1 Open-Source Model on Artificial Analysis Leaderboard
Excellent Photorealistic Image Generation
Accurate Bilingual (English/Chinese) Text Rendering
Strong Prompt Enhancing and Reasoning Capabilities

Images

Z-Image Leaderboard Performance

Z-Image ranking on Artificial Analysis Text-to-Image Leaderboard

Related Links

GitHub Repository

Popular Tools

Explore our most popular creative tools

Z-Image Edit

Upload image, transform with one sentence

Creative Engine

One sentence, AI provides infinite prompt creativity.

Image Parse

Upload image, retrieve prompt instantly.

Prompt Library

Discover thousands of high-quality AI prompts.

Z-Image LoRA

Combine multiple LoRA models to create unique AI artwork

Z-Video

Generate creative videos from text or images with AI.

AI Image Generator

Turn your text into stunning images instantly.

Style Library

Explore curated artistic styles for your creations.

Remove Background

Instantly remove backgrounds from images with precision.

Image Upscaler

Enhance image resolution up to 4K/8K.

Image Reframe

Expand images to any aspect ratio with outpainting.