Back to List

Qwen3-4B GGUF Text Encoder

GGUF quantized version of Qwen3-4B language model, essential text encoder for Z-Image GGUF deployments with thinking mode support

Text Encoder
LLM
GGUF
Qwen3
4B Parameters
Bilingual
Thinking Mode

Overview

Qwen3-4B-GGUF is the required text encoder for Z-Image GGUF deployments, providing bilingual (Chinese/English) understanding and advanced reasoning capabilities. Supports unique thinking mode for complex logical reasoning tasks.

Features

  • 4B parameters with efficient GGUF quantization
  • Seamless switching between thinking and non-thinking modes
  • Excellent bilingual text understanding (Chinese & English)
  • 32K native context length (131K with YaRN)
  • Superior reasoning, coding, and math capabilities
  • 36 layers with GQA architecture (32 Q heads, 8 KV heads)
  • Multiple quantization levels (Q4_K_M recommended)

Installation

Download GGUF file and place in ComfyUI/models/text_encoders/ directory

Usage

Use as text encoder with Z-Image GGUF models in ComfyUI. Supports enable_thinking parameter for activating reasoning mode.

Requirements

  • 4GB+ VRAM for text encoding
  • Compatible with Z-Image GGUF models
  • ComfyUI GGUF extension

Related Links