Z-Image
An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer with 6B parameters.
Overview
Z-Image is a powerful and highly efficient image generation foundation model with 6B parameters. Leveraging the Scalable Single-Stream Diffusion Transformer (S3-DiT) architecture, it processes text, visual semantic tokens, and image VAE tokens as a unified stream. Z-Image serves as the core for variants like Z-Image-Turbo and Z-Image-Omni-Base, delivering state-of-the-art performance among open-source models.
Features
- 6B Parameter Efficient Foundation Model
- Scalable Single-Stream DiT (S3-DiT) Architecture
- Ranked #1 Open-Source Model on Artificial Analysis Leaderboard
- Excellent Photorealistic Image Generation
- Accurate Bilingual (English/Chinese) Text Rendering
- Strong Prompt Enhancing and Reasoning Capabilities
Images

Related Links
Popular Tools
Explore our most popular creative tools
Z-Image Edit
Upload image, transform with one sentence
Creative Engine
One sentence, AI provides infinite prompt creativity.
Image Parse
Upload image, retrieve prompt instantly.
Prompt Library
Discover thousands of high-quality AI prompts.
Z-Image LoRA
Combine multiple LoRA models to create unique AI artwork
Z-Video
Generate creative videos from text or images with AI.
AI Image Generator
Turn your text into stunning images instantly.
Style Library
Explore curated artistic styles for your creations.
Remove Background
Instantly remove backgrounds from images with precision.
Image Upscaler
Enhance image resolution up to 4K/8K.
Image Reframe
Expand images to any aspect ratio with outpainting.