Back to List

Z-Image-Omni-Base

Strategic evolution of Z-Image-Base, featuring omni pre-training for unified image generation and editing, avoiding complexity and performance loss of task switching.

Omni-Base
Omni Pre-training
Unified Architecture
S3-DiT
Generation & Editing

Overview

Z-Image-Omni-Base marks a strategic shift from the original 'Base' model towards an 'omni' (omnipotent) pre-training architecture. It unifies image generation and editing/inpainting tasks within a single framework using the Scalable Single-Stream Diffusion Transformer (S3-DiT). This omni pre-training allows for seamless transitions between generating new images and editing existing ones without the need for separate specialized models, offering higher parameter efficiency and flexibility for developers.

Features

  • Omni Pre-training for Unified Generation and Editing
  • Seamless Task Switching without Performance Loss
  • Scalable Single-Stream DiT (S3-DiT) Architecture
  • Supports Cross-Task LoRA Adapters
  • 6B Parameter Efficiency
  • Superior Performance on Complex Multimodal Tasks

Images

Z-Image-Omni-Base Architecture and Vision
The transition from Base to Omni-Base: Unified Architecture for Generation and Editing

Related Links

Popular Prompts

Discover more creative ideas